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TITLE OF THE INVENTION 

High speed virtual Machine and Coapiler 

BACKGROUND OF THE INVENTION 

(1) Field of the Invention 

The present invention relates to virtual machines and 
to virtual machine compilers. m particular, the invention 
relates to a technique for increasing the execution speed of 
virtual machines. 

(2) Des cription of the Prior Art- 
Standard Virtual Machine 

Virtual machines are used to have a same program 
executed by computers, such as personal computers and 
workstations, that include different types of CPU. virtual 
machines are useful in the field of communications, 
especially on a network to which different types of 
computers are connected, since they can overcome the 
differences in CPU architecture between computers and so 
allow the efficient and high-speed use of shared resources. 
Note that in this specification, CPUs are called "real 
machines". 

A virtual machine is a virtual processor, which is to 
say, a processor achieved by executing software. A virtual 
machine decodes and executes executable programs 
(hereinafter referred to as "virtual machine programs" or 
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"virtual machine instruction sequences") that are sequences 
of instructions (hereinafter, "virtual machine 
instructions") specific to the virtual machine. Virtual 
machines are normally realized by programs (hereinafter, 
"real machine programs" or "real machine instruction 
sequences" composed of instructions (hereinafter, "real 
machine instructions") specific to a target real machine on 
which the virtual program is to be run. Maintaining a high 
execution speed is a central issue for virtual machines, so 
that many virtual machines have a stack architecture. 

One example of conventional virtual machines are the 
JAVA (trademark) virtual machines developed by SUN 
MICROSYSTEMS, INC. 

Fig. 1 is a block diagram showing a construction of a 
conventional virtual machine 4400 with a stack architecture, 
such as a JAVA virtual machine. The virtual machine 4400 
compr i—r «->, e instruction storing unit 4401, the decoding 
unit 4402, the executing unit 4410, and the stack 4420. The 
instruction storing unit 4401 stores a virtual machine 
program to be executed. The decoding unit 44 02 reads and 
decodes a virtual machine instruction. The execution unit 
4410 executes operations according to the decoded data 
produced by the decoding unit 4402. The stack 4420, which 
is a LIFO (last-in first-out) memory area, temporarily 
stores data used in the processing of the execution unit 
4410. in Fig. 1, solid lines show the data flows, while 
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dotted lines show the control flows. 

decoding unit 4402 includes the decode table 
4406, the program counter (PC) 4404, the instruction reading 
unit 4403, and the search unit 4405. The decode table 4406 
stores data, such as jump addresses of microprograms (stored 
in the executing unit 4410) that correspond to all of the 
virtual machine instructions that can be executed by the 
virtual machine 4400 with a stack architecture. The program 
counter (PC) 4404 holds the address of the next instruction 
to be read from the instruction storing unit 4401. The 
instruction reading unit 4^03 reads this next instruction. 
The search unit 4405 refers to the decode table 4406 to find 
a jump address corresponding to the read instruction and 
output_ jump address to the execution unit 4410. In 

this specification, a microprogram is a real machine program 
that corresponds to a virtual machine instruction. 

The executing unit 4410 includes a microprogram 
storing unit 4411 and a stack pointer (SP) 4412. The 
microprogram storing unit 4411 stores microprograms, which 
are real machine programs corresponding to virtual machine 
instructions, in advance at locations indicated by jump 
addresses. The stack pointer (SP) 4412 indicates the 
address at the top of the stack 4420. 

Fig. 2 is a table for describing the instruction set 
of the virtual machine 4400. in Fig. 2 , all of the virtual 
machine instructions that the virtual machine 4400 can 
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decode and execute are shown in mnemonic form, along with 
the operation content of each instruction, changes in the 
content of the stack 4420 caused by each instruction, and 
the value of the SP 4412 after execution. In Fig. 2, the 
6 legend "sO" indicates the value at the top of the stack 

4420, while "si" indicates the second highest value. as one 
example, the notation "sp^sO+sl" f0r the virtual machine 
instruction "Add" denotes that the value at the top of the 
stack is set equal to a sum of the top and second highest 
CjlO values of the stack before execution. The notation "sp^- 
j| Sp ~ 1 " denotes that the height of the stack decreases by one 

H due to the execution of the "Add" instruction, 

p Fig. 3 shows the stored contents of the decode table 

W 4406 shown in Fig. 1. This decode table 4406 includes 

opcodes 4406a that indicate the operation types of virtual 
machine instructions, jump addresses 4406b which are the 
addresses of microprograms in the microprogram storing unit 
4411 that correspond to these virtual machine instructions, 
and numbers of operands 4406c that show the number of 
operands in each virtual machine instruction. Here, each 
opcode is set as 1-byte long, and operands are counted in 
one-byte units. Virtual machine' instructions, which may 
include only an opcode or only an operand, that are 
represented by a physical bit pattern are hereinafter 
referred to as "virtual machine code". 

Figs. 4A-4D show examples of the microprograms stored 
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in the microprogram storing unit 4411 in Fig. 1. The 
microprograms in Pigs. 4A-4C respectively correspond to the 
virtual machine instructions "Push", "Add" , and "Mult", 
while the microprogram in Pig. 4D shows a microprogram that 
5 forms the common latter part of each of the microprograms in 

Figs. 4A-4C. This microprogram in Fig. 4D is a real machine 
program for jumping to the next virtual machine instruction. 
The operation contents of the real machine instructions in 
these microprograms are shown in Fig. 5. The virtual 

|10 machine 4400 itself is realized by a real machine that can 

U decode and execute the rea^machine instructions shown in 

Fig. 5. Note that the PC 4404 is physically realized by 

* register #2 (r2) of the real machine, and the SP 4423 by 

0 register #3 (r3) . 

Fig. 6 is a flowchart showing the processing of 
decoding unit 4404 shown in Fig. 1. The instruction reading 
unit 4403 is instructed by the execution unit 4410 via a 
signal line R to read the next instruction (steps 4502-4503) 
and so reads the virtual machine instruction with the 
address stored in the PC 4404 from the instruction storage 
unit 4401 (steps 4504-4505). Following this, search unit 
4405 refers to the decode table." 4406 to find a jump address 
and operands corresponding to the read virtual machine 
instruction, outputs the jump address and operands (if any) 
to the executing unit 4410 as decoded data (step 4506), and 
gives the executing unit 4410 a "read end" notification via 
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the signal line R (step 4507). This "read end" notification 
marks the completion of decoding for one virtual machine 
instruction. 

Fig. 7 is a flowchart showing the processing in step 
4506 in detail. The search unit 4405 compares 1-byte of 
virtual machine code (the opcode) read by reading 44 03 with 
one opcode 4406a in decode table 4406 at a time until a 
match is found (steps 4802-4807). The search unit 4405 then 
reads the jump address 4406b and the number of operands 
4406c corresponding to the matching opcode 4406a from the 
decode table 4406. The search unit 4405 outputs the read 
jump address 4406b to the executing unit 4410 (step 4808), 
has the instruction reading unit 4403 read as many operands 
as are indicated by the number of operands 4406c from the 
instruction storing unit 4401, and outputs the operands to 
execution unit 4410 (steps 4809-4813) . 

The flowcharts of Figs. 6 and 7 show the processing 
when decoded data sent from the decoding unit 4402 Is 
directly transferred to the executing unit 4410. The 
flowchart in Fig. 8 shows the case when the decoded data is 
transferred to the executing unit 4410 via a buffer that is 
capable of storing setS of decoded data. In this latter 
case, the reading of virtual machine instructions from the 
instruction storing unit 4401 and the subsequent decoding 
may be performed independently 'of the execution by the 
executing unit 4410 and repeated as long as there is space 
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in thi, jJJci (steps 4605-4613) « 

Fig. 9 shows the processing of executing unit 4410 in 
Fig, 1. The executing unit 4410 initializes SP 4412 and PC 
4404 (step 4702) and repeats the processing described below 
for each virtual machine instruction (steps 4703-4707) . 
That is, the executing unit 4410 instructs the instruction 
reading unit 4403 via the signal line R to read the next 
virtual machine instruction (step 4703) * The executing unit 
4410 then reads decoded data transmitted from the search 
unit 4405, jumps to a jump address that is included in the 
decoded data and that specifies a microprogram stored in the 
microprogram storing unit 4411,, the microprogram 
corresponding to the read virtual machine instruction, and 
executes the microprogram until the executing unit 4410 
receives a "read end" notification via the signal line R 
(steps 4704-4707) . 

Fig. 10A shows a sample program for describing a 
specific example of the processing of the virtual machine 
4400, In this example, instruction storing unit 4401 stores 
a virtual machine program for calculating the arithmetic 
expression "2* (3+4)" shown in Fig. 10B. 

Fig. 10C shows the decoded data that is sequentially 
outputted from the decoding unit 4402 when the virtual 
machine program shown in Fig, lOA is decoded and executed by 
the conventional virtual machine 4400* The decoding unit 
4402 successively outputs jump addresses and the necessary 
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operands corresponding to the decoded virtual machine 
instructions as decoded data to the executing unit 4410. 

Figs. 11A and 11B show the states of the PC 4404, the 
SP 4412, and the stack 4420 before and after the execution 
of the each virtual machine instruction when the executing 
unit 4410 executes the virtual machine program shown in Fig. 
10A in accordance with the decoded data sequences shown in 
Fig. IOC. These figures show the processing of the virtual 
machine program split into a former and a latter part. 
Here, rv.' indicates the address of the next virtual 

machine instruction to be executed in the virtual machine 
program. The addresses of virtual machine instructions are 
the numbers shown to the left of the virtual machine 
instructions in Fig. 10A. The initial value of the PC 4404 
is "1". The SP 4412 indicates the top of stack 4420, and so 
marks a position at which an:item was most recently stored 
or read. The initial value of SP 4412 is and indicates 

that the stack 4420 is empty. As can be understood from 
Figs. 11A and 11B, the calculation of the arithmetic 
expression "2* (3+4)" is completed when PC 4404 indicates 

if g ii 

The major problem for conventional virtual machines 
like virtual machine 4400 is how to increase execution 
speed. Processes such as the decoding of virtual machine 
instructions generate overheads, so that virtual machines 
end up operating at a much slower speed than when an 
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equivalent real machine program is directly executed by a 
real machine. To improve the performance speed of virtual 
machines, the following methods have been proposed. 

5 First Conventional Technique 

In this first conventional technique/ the storage 
area at the top of the stack (TOS) is assigned not to memory 
but to a specified register of a real machine. Hereinafter, 
such a storage area is called the TOS variable (See pp315- 
jlO 327 "PLDI" (1995), ACM). 

U Figs. 12A-12D are* microprograms corresponding to the 

I principal virtual machine instructions that are stored in a 

V microprogram storage unit of a virtual machine based on this 

jj first conventional technique. These figures correspond to 

Figs. 4A-4D that were used to describe the virtual machine 
4400. This example uses the : following physical mapping. The 
TOS variable is assigned to register #0 (rO) of the real 
machine and, as in Figs. 4A-4D, PC 4404 to register #2 (r2), 
and SP 4421 to register #3 (r3) . 

Figs. 13A and 13B show the changes in the states of 
the PC 4404, the SP 4412, the TOS variable 4421, and the 
memory stack 4422 (the part of the stack 4420 that is 
allocated to memory) as a virtual machine provided with the 
microprograms shown in Fig. 12A-12D executes the virtual 
machine program shown in Fig. 10A. These figures shows the 
processing split into a former and a latter part and 
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correspond to the Figs. Ha and llB that were used to 
describe the operation of the virtual machine 4400. As 
before, the calculation of the arithmetic expression 
"2* (3+4)" is completed in Figs. 13A and 13B when the PC 4404 
indicates "9". 

As can be seen by comparing Figs. 12A-12D with Figs. 
4A-4D, the first conventional technique makes fewer accesses 
to the memory. When the virtual machine 4400 executes a 
virtual machine instruction such as an addition "Add" or a 
multiplication "Mult", two reads and one write are performed 
for the stack 4420/ makincp/a- total of three memory accesses 
for one virtual machine instruction, with the first 
conventional technique, the assigning of the TOS variable to 
a register enables the same instruction to be executed with 
only one, access to the memory stack 4422. This results in 
the execution speed being increased in proportion to the 
reduction in the number of memory accesses. 

Second Conventional Technicrue 

A second conventional technique uses a "native 
coding" method, in which a predetermined part of a virtual 
machine programs is written in real machine instructions and 
is directly executed by a real machine. As a result, 
identifiers are used to indicate that such predetermined 
part is written using real machine instructions. 

As one example, a JAVA virtual machine can store the 
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constant name "ACC_NATIVE" (256) into an access flag (such 
as the 16-bit flag "access_flags" that forms part of the 
"method_info" structure) of a class file that includes a 
virtual machine program to show that part of the program is 
written in real machine instructions (see the Java Bytecodes 
and the JAVA virtual Machine Specification, 1995 editions, 
produced by SUN MICROSYSTEMS, INC.). 

In this way, this second conventional technique 
improves execution speed by having the real machine directly 
execute a predetermined part of a program. 

» " '■ ■ 

Third C op ventional Technique 

A third conventional technique uses a "just-in-time" 
(JIT) compiler that compiles parts of a virtual machine 
program as required during execution. Here, compiling 
refers to the replacement of- virtual machine instructions 
with real machine instructions (see Laura Lemay et al., Java 
Gengc Nyumon (An Introduction to JAVA), Prentice Hall, 1996, 
and Laura Lemay and Charles L . Perkins, Teach yourself JAVA 
in 21 days) . Virtual machines that use a JIT compiler have 
the real machine directly execute compiled parts of a 
virtual machine program, and so - increase the overall 
execution speed of virtual machine programs. 

Fourth Conventional Technique ' 

A fourth conventional technique is used when 
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computers on a network execute virtual machine programs that 
they download from a server computer. In this technique, 
the code in a virtual machine program is compressed 
beforehand using LZ (Lempel-Zif ) methods or Huffman coding 
to reduce the time taken by file transfer (see Japanese 
Laid-open Patent Application H07-121352 or H08-263263) . 

With this technique, an increase in execution speed 
can be obtained if the time taken to transfer the virtual 
machine i^ogram forms a large part of the overall processing 
time required to execute the virtual machine program. 

^ . - 

The first to fourth conventional techniques described 
above have the following problems. 

Problem s with the First Conventional Technique 

The first conventional technique, where the TOS 
variable is allocated to a register of a real machine, has a 
drawback in that it is not suited to real machines with 
superscalar architecture that have become increasingly 
inexpensive in recent years. This means that the 
improvements in the execution speed for a superscalar real 
machine (hereinafter, "superscalar machine") are relatively 
small when compared with the improvement for a standard real 
machine (hereinafter called a "standard machine") that is 
incapable of parallel processing. This is described in more 
detail below. 
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The following describes the standard operation and 
notation of a pipeline used by a register machine, such as 
a superscalar machine or a standard machine, with reference 
to Figs. 14-22. 

Fig. 14 shows the mnemonics used to indicate each 
stage included in the pipeline. The superscalar machine and 
a standard machine described below are assumed to each have 
a pipeline containing the five stages shown in this figure. 

Fig. 15 shows the ideal pipeline flow for a standard 
machine, in this example, four real machine instructions 
are sequentially process©* with each pipeline stage taking 
exactly one clock cycle. Each pipeline stage is performed 
in parallel for a different real machine instruction so that 
as the long-term average, one instruction is executed in one 
clock cycle. 

Fig. 16 shows an ideal pipeline flow for a 
superscalar machine. This superscalar machine has two 
separate pipelines. In Fig. 16, two real machine • 
instructions are executed in one clock cycle as the long- 
term average, giving the superscalar machine a throughput 
twice that of the standard machine. 

Fig. 17 shows a pipeline flow for a standard machine 
when pipeline hazards occur. Here, instruction B uses the 
execution result of instruction' A, which is to say, 
instruction B has a true dependency (also called a data 
dependency) on the preceding instruction A. Since the 
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execution result of instruction A cannot be obtained until 
the memory access stage mem is completed, the execution of 
instruction B is delayed, which causes the hazard as shown 
by "-" in the figure. 

When the processing of an instruction is delayed in a 
real machine with a pipeline structure/ the processing of 
the following instructions is also delayed. This is shown 
in Fig. 17, where the processing of instruction C, which 
follows instruction B, is also delayed. 

Fig. 18 shows a pipeline flow for a superscalar 
machine when pipeline hazards occur. Here, instruction Bl 
has a true dependency on the preceding instructions Al and 
A2. Here, the reason that a pipeline hazard occurs in the 
fifth clock cycle for the instruction C2 is that the two 
processing units (arithmetic logic units or "ALUs") provided 
in the processor are busy with the execution of the 
preceding instructions Bl and CI. This means that 
instruction C2 cannot be executed in that cycle. 

Figs. 19 and 20 correspond to Figs. 17 and 18, and 
show pipeline flows when two clock cycles need to pass 
before values obtained through memory access (MEM) can be 
used. In reality, in most real 'machines, obtaining a value 
from the primary cache takes two clock cycles. Note that 
obtaining a value from the secondary cache takes more clock 
cycles. 

Figs. 21 and 22 respectively show pipeline flows for 
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a standard machine and superscalar machine when instructions 
Al and A2 are instructions that indicate a jump destination 
using a register. The jump destinations of these 
instructions are not known until the register reference 
stage (RF) is completed, so that the succeeding instructions 
B, Bl, and B2 that are fetched as per normal during the 
register reference operation are canceled (as shown by the 
"x" in Figs. 21 and 22) in the third clock cycle following 
the RF stages. 

The following describes the specific problems of a 
superscalar machine and a yeal machine of the first 
conventional technique, with reference to Figs. 23-26. 

Figs. 23-2 6 show pipeline flows when the virtual 
machine of the first conventional technique is realized by a 
real machine executing the virtual machine program shown in 
Fig. 10A. in detail, these figures show the pipeline flow 
for the latter part (the jump processing shown in Fig. 12D) 
of the microprogram (of Fig. 12A) with the address -7 that 
corresponds to the virtual machine instruction "Add" and the 
pipeline flow for the former part (the multiplication 
processing) of the microprogram (of Fig. 12C) with the 
address 8 that corresponds to the virtual machine 
instruct™ "Mult". Figs. 23 and 24 respectively show the 
pipeline flows for a standard machine and a superscalar 
machine where one clock cycle needs to pass before a value 
read during a memory access can be used, while Figs. 25 and 
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26 respectively show the pipeline flows for a standard 
machine and a superscalar machine where two clock cycles 
needs to pass before a value read during a memory access can 
be used. 

5 This series of microprograms shown in Figs. 12D and 

12A contain two significant true dependencies. The first is 
in the microprogram for jump processing shown in Fig. 12D 
corresponding to the virtual machine instruction "Add", and 
exists between the instruction "Load" for reading a jump 
JS! 10 address and the instruction "Jump" for jumping to the 

8Uj address. The second is in> the microprogram shown in Fig. 

m 12C corresponding to the virtual machine instruction "Mult" 

m f or multiplication processing and exists between the 

instruction "Load" for reading a variable from the memory 
stack and the instruction "Mult" for multiplication 
processing. 

In the pipeline shown in Fig. 23, the first data 
dependency is absorbed by the real machine instruction "Ine" 
that is inserted between the instructions "Load" and "Jump". 
The sec: -.2 data dependency is absorbed by the real machine 
instruction "Dec" that is inserted between the instructions 
"Load" and "Mult". The processing in this pipeline is only 
disturbed by the cancellation of one instruction that is 
necessitated by the execution of the real machine 
instruction " jmp" . as a result, the entire procedure is 
completed in 11 cycle clocks. 
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In the pipeline shown in Fig. 24, the first and 
second data dependencies are not absorbed. As a result, the 
processing in these pipelines is disturbed at three points. 
The first disturbance is the hazard in the fourth clock 
cycle caused by the first data dependency, the second is the 
cancellation of five instructions necessitated by the 
execution of real machine instruction «ja$>«, an d the third 
is the hazard in the eighth clock cycle caused by the second 
data dependency. As was the case with Fig. 24, the entire 
procedure is completed in 11 clock cycles in Fig. 23. 

As in Fig. 24, the> above first and second data 
dependencies are not absorbed in the pipeline shown in Fig. 
25, so that the processing in this pipeline is disturbed at 
three points. The first disturbance is the hazard in the 
fifth clock cycle caused by the first data dependency, the 
second is the cancellation of one instruction necessitated 
by the execution of the real machine instruction "Jtap", and 
the third is the hazard in the tenth clock cycle caused by 
the second data dependency. The entire procedure is 
completed in 13 clock cycles. 

As in Fig. 24, the above first and second data 
dependencies are not absorbed in the pipeline shown in Fig. 
2 6, so that the processing is disturbed at three points. 
The first disturbance is the hazards caused in the fourth 
and fifth clock cycles by the first data dependency, the 
second is the cancellation of seven instructions 
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necessitated by the execution of the real machine 
instruction "Ota*)" , and the third is the hazards caused in 
the ei^V* and tenth clock cycles by the second data 
dependency. As in Fig. 25, the entire procedure is 
5 completed in 13 clock cycles. 

Considering that the processing shown in either of 
Figs. 23 and 24 requires 11 clock cycles and that the 
processing shown in either of Figs. 25 and 26 requires 13 
clock cycles, it is clear that there is no difference in 
^10 execution time between a standard machine and a superscalar 

jfUj machine for this first conventional technique. This means 

that no advantage is gained from using a superscalar machine 
•p capable of parallel processing. 

7 In this w *y, this first conventional technique causes 

a large l^op in the processing efficiency of a superscalar 
machine. Another drawback is the lack of provisions for 
exception handling, such as for errors, or interrupt 
handling, which is required for debugging. 

As a result, a virtual machine that uses this first 
conventional technique needs to detect an interrupt state 
and to perform interrupt handling every time the machine 
executes a virtual machine instruction. This means that 
another memory access (i.e., data transfer of a variable in 
the memory that indicates an interrupt state into a 
register) is required every time a virtual machine 
instruction is executed. This cancels out the advantage of 
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this first conventional technique, wherein assigning the TOS 
variable to a register reduces the number of memory 
accesses, so that the overall execution speed is not 
improved. 

Problems with the Second Conventional Technique 

The second conventional technique, which is to say 
the use of native coding, has a problem in that it is 
difficult to provide common virtual machine programs to real 
machines with different architectures. This is because part 
of the virtual machine program is written in real machine 
instrui-w^wns for a specific type of real machine. As a 
result, when a virtual machine program is to be provided on 
a network for common use by five types of computers with 
different real-machine architectures, it becomes necessary 
to provide real machine programs of all five real machines. 

Since there are also differences in system 
configuration between computers, there is no guarantee that 
real machine instructions will have a faster execution speed 
than virtual machine instructions, even for real machines 
with the same architecture. As one example, if programs are 
written for RISC (Reduced Instruction Set Computers) type 
real machines where code size is generally large, the use of 
insufficient memory will lead to frequent page swapping 
between main- and virtual memory when virtual machine 
instructions are replaced with real machine instructions. 
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This reduces the overall execution speed. 

Problems with the Third Conventional Technique 

The third conventional technique, which uses a JIT 
compiler, has a problem in that the compiling of the virtual 
machine program can take a long time. The reasons for this 
are explained below* 

A first reason is that the processing must satisfy 
the specific restrictions of the target real machine 
concerning jump destinations. As one example, when the 
target machine has a resti^Lction that the address of a jump 
destination must be within word (basic word length) 
boundaries in the main memory, simple conversion of the 
virtual machine instructions to corresponding real machine 
instructions will result in a violation of this restriction. 

Fig, 27 is a program: list for a sample virtual 
machine program for explaining this first reason. Fig. 28 
is a flowchart for this sample virtual machine program. 

The present virtual machine program calculates the 
total of ten integers from zero to nine. It is composed of 
a setting of initial values (step 7002, Addresses 0-6), 
judgment of the end of caiculation (step 7003, Addresses 
8-13), addition and setting of the next value to be added 
(step 7004, Addresses 15-29), and end processing (step 7005, 
Address 31) . 

Fig, 29 is a conversion table that is used when 
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compiling this virtual machine program according to this 
third conventional technique. This conversion table is a 
correspondence table that associates virtual machine 
instructions with the real machine programs into which they 
are to be converted. Note that for reference purposes, the 
conversion table in Fig. 29 also shows the code size of each 
real machine program- 
Fig. 30 shows the code arrangement of the real 
machine program that is obtained when the sample virtual 
machine program shown in Fig. 27 is compiled using the 
conversion table shown in fig. 29. In Fig. 30, relative 
addresses in original virtual machine program are given for 
each real machine program to show the correspondence between 
the real machine program and the virtual machine program. 

If the target real machine has a restriction whereby 
only jump destinations complying with a two-word alignment 
can be indicated, it can be seen from Fig. 30 that the 
virtual machine instruction "Stop" with address 31 -that is 
the jump destination indicated by the virtual machine 
instruction "Brz" at address 13 is arranged at odd-numbered 
addresses in the real machine program. Since this address 
does not correspond to the two-word alignment, this branch 
instruction violates the restrictions concerning jump 
destinations. As a result, processing that rectifies this 
violation needs to be performed. 

A second reason for the above problem is that special 
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processing that accompanies branches can be necessary for 
the target real machine. Some CPUs with RISC architecture, 
such as CPUs with SPARC (Registered Trademark) architecture 
produced by SPARC INTERNATIONAL, INC. and CPUs produced by 
MIPS TECHNOLOGIES, INC., have special rules that are used 
when executing a number of instructions located after a 
branch instruction. Specific examples of these rules are 
the execution of a specific succeeding instruction 
regardless of whether a branch is performed (called a 
"delayed branch") or the execution of a specific succeeding 
instruction only when a branch is performed (called a 
"canceling branch") . 

When the target real machine is of this type, special 
processing needs to be performed, such as scheduling that 
analyzes the instructions and changes their order or the 
insertion of no operation instructions (such as NOP codes) 
directly after branch instructions. 

Problems with the Fourth Conventional Technique 

The fourth conventional technique, which is to say 
the compression of virtual machine programs in advance, has 
a problem in that there is no resolving means for dealing 
with problems that occur due to the execution of branch 
instructions in the compressed "virtual machine program. 

Fig. 31A shows a compression table for explaining 
this problem. This compression table associates variable- 
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length codes 9300a with virtual machine instructions 9300b. 
Fig. 31B is example code that is obtained by encoding the 
virtual machine instruction sequence A using the compression 
table shown in Fig. 31A. 
5 If the example code shown in Fig. 31B is decoded 

starting from the first bit, the original virtual machine 
instruction A ("babe") will be obtained. However, when the 
execution flow moves to point B in Fig, 31B due to a branch 
instruction, decoding the code sequence "0010110" that 
^0 starts at point B using the compression table in Fig. 31A 

jlJ gives the mistaken virtual* machine instruction "aabc". 

;JfL Problems Common to the First-Fourth Conventional T echniques 

^ The first-fourth conventional techniques described 

015 above have a common problem in that none of them is able to 

Q raise the efficiency of cache processing. As a result, the 

ijS market is still waiting for the realization of a high-speed 

virtual machine that makes full use of the processing power 
of real machines and computers that are equipped with a 
20 cache memory. 

Fig. 32 is a block diagram showing the program 
counter 6901 and the instruction cache 6902 of a virtual 
machine- This drawing will be used to explain the problems 
that can occur for a virtual machine that is equipped with a 
25 cache memory. 

The instruction cache 6902 is equipped with a cache 
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table 6904 that stores addresses for specifying each cache 
block in the cache memory, where a cache block is an 
instruction sequence 6903 composed of the data in ten 
consecutive addresses- Fig, 33 shows the case where the 
5 sample virtual machine program shown in Fig. 27 is stored in 

the cache memory, with the boundary lines A, B, and C 
marking the boundaries between the cache blocks. These 
boundarv lines simply divide the virtual machine program 
into cache blocks of an equal size, as can be seen from the 
■^0 boundary line C that splits the virtual machine instruction 

jy »Br 8" into the opcode "BrS' and the operand "8". 

W Accordingly , when dividing a virtual machine program into 

;jp cache blocks, it is necessary to judge whether any of the 

virtual machine instructions that changes the value of the 
program counter 6901 will end up spanning a boundary between 
^ cache blocks. This increases the complexity of the 

^Ol processing and results in an actual decrease in the overall 

execution speed of a virtual machine when a cache is 
provided. 

20 would be conceivably possible to devise a method 

for storing an entire virtual machine program in cache 
memory or a method for arranging the virtual machine program 
in the cache based on analysis of the virtual machine 
program by a JIT compiler. However, the former of these 

25 methods uses cache memory inefficiently and has a further 

problem in that the time required for file transfer in a 
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network environment is greatly increased. The latter 
method, meanwhile, has a problem in that writing the virtual 
machine program into cache memory is very time-consuming. 
Accordingly, both of these methods result in a marked 
5 decrea°~ * n the overall execution efficiency of a virtual 
machine . 

SUMMARY OF THE INVENTION 

in view of the above problems, the present invention 
•^0 has an overall aim of providing a virtual machine that 

W executes a virtual machine program at a higher execution 

iffl speed than a conventional virtual machine, a virtual machine 

compiler that generates a program for this virtual machine 
^ (hereafter, a virtual machine and a virtual machine compiler 

■MB are together called a virtual machine system) , and a JIT 

O compiler. Here, a virtual machine compiler refers to a 

; yj progr:-: ■ :.._t translates a source program written in a high^ 

level language such as C into a virtual machine program. 

To achieve the above aim, the invention has the 
20 following six specific objects. 

The first object is to provide a virtual machine 
system that can diminish disadvantages caused by true data 
dependencies so that high execution speed is maintained. 

The second object is to provide a high-speed virtual 
25 machine system by minimizing the decreases in execution 

efficiency caused by interrupt handling. 
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The third object is to provide a virtual machine 
system which "native coding" for different real 

machines can be performed without decreasing overall 
execution speed, even when the virtual machine is used by 
5 real machines with different architectures. Such a virtual 

machine is highly independent of real machine architec Lures 
without decreasing execution speed. 

The fourth object is to provide a high-speed virtual 
machine system that can be used by a real machine with a 
yiO cache system without decreases in execution efficiency which 

jlU may result from a virtual machine instruction program being 

CO' divided into cache blocks or from complicated resolving 

addresses being performed when using a JIT compiler. 
*y fifth object is to provide a high-speed virtual 

U15 machine system that can decompress a compressed virtual 

machine program correctly even when the compressed program 
contains branch instructions, 

The sixth object is to provide a high-speed JIT 
compiler that does not need to perform a complex resolving 
20 of addresses . 

The first object can be achieved by the virtual 
machine of Claim 1. 

According to Claim 1, the virtual machine executes a 
virtual machine instruction sequence under control of a real 
25 machine, the virtual machine comprising: a stack unit for 

tempoiaj.'j.xy storing data in a last-in first-out format; an 
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instruction storing unit for storing the virtual machine 
instruction sequence and a plurality of sets of succeeding 
instruction information, wherein each virtual machine 
instruction in the virtual machine instruction sequence is 
associated with a set of succeeding instruction information 
that indicates a change in a storage s Late of the data in 
the stack unit due to execution of a virtual machine 
instruction executed after the associated virtual machine 
instruction; a read unit for reading a virtual machine 
instruction and an associated set of succeeding instruction 
information from the instruction storing unit; and a 
decoding-executing unit for specifying and executing 
operations corresponding to a combination of the read 
virtual machine instruction and the read set of succeeding 
instruction information. 

With the above construction, the instruction storing 
unit stores next instruction information in addition to 
virtual machine instructions and the decoding-executing unit 
performs not only operations for the decoded virtual machine 
instruction but also a stack handling in advance for a 
virtual machine instruction executed immediately after the 
decoded virtual machine instruction. Performing appropriate 
stack handling in advance in machine cycles where pipeline 
hazards (which occur especially' frequently in superscalar 
machines) would otherwise occur, enables the detrimental 
effects of true data dependencies to be absorbed and so 
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increases the execution speed of the virtual machine. 

Here, the decoding-executing unit may include: a 
real machine instruction sequence storing unit for storing a 
plurality of real machine instruction sequences that 
correspond to all combinations of virtual machine 
instructions and sets of succeeding instruction information; 
a specifying unit for specifying a real machine instruction 
sequence in the real machine instruction sequence storing 
unit, the real machine instruction sequence corresponding to 
a combination of the virtual machine instruction and the set 
of succeeding instruction information read by the read unit; 
and an executing unit for executing the specified real 
machine instruction sequence. 

In this way, advance stack handling for absorbing 
data dependencies can be included in the real machine 
instruction sequence corresponding to a virtual machine 
instruction. 

Here, each set of succeeding instruction information 
may indicate a change in a number of sets of data in the 
stack unit due to execution of a virtual machine instruction 
executed after a virtual machine instruction associated with 
the set of succeeding instruction information, and at least 
one real machine instruction sequence stored in the real 
machine instruction sequence storing unit may contain real 
machine instructions that perform a stack handling in the 
stack unit in advance for a virtual machine instruction that 
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is to be executed based on a set of succeeding instruction 
information associated with a currently executed virtual 
machine instruction . 

With this construction, when a change in a number of 
stack levels due to execution of a given instruction is 
canceled out by execution of an instruction executed 
immediately after the given instruction, needless stack 
handling can be avoided, which improves the execution speed 
of the virtual machine* 

Here, the real machine instruction sequences stored 
in the real machine instruction sequence storing unit may be 
composed with a premise that regions of the stack unit used 
to store two sets of data to be read first and second are 
mapped to two registers in the real machine . 

The above construction replaces the load and store 
stack operations that are frequently performed by stack-type 
virtual machines with read/write operations for the internal 
registers of the real machine. Such operations are suited 
for rearrangement as the advance stack handling performed in 
machine cycles where pipeline hazards would otherwise occur. 
In this way, execution efficiency of the virtual machine is 
raised. 

Here, the instruction storing unit may include a 
first storage area for storing the virtual machine 
instruction sequence and a second storage area for storing 
the sets of succeeding instruction information, wherein each 
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location that stores a virtual machine instruction in the 
first storage area may be associated with a location that 
stores an associated set of succeeding instruction 
information in the second storage area, and the read unit 
may read the virtual machine instruction from a location in 
the first storage area and the associated set of succeeding 
instruction information from a location in the second 
storage area, the location in the first storage area being 
associated with the location in the second storage area. 

In this way, a virtual machine instruction sequence 
and next instruction inf ozonation are stored separately, 
which means that a virtual machine instruction sequence of 
the present virtual machine has the same data format as a 
conventional virtual machine instruction sequence. 
Compatibility of instruction data format with a conventional 
virtual machine is therefore : maintained. 

Here, the virtual machine instruction sequence stored 
in the instruction storing unit may be an extended virtual 
machine instruction sequence that includes extended virtual 
machine instructions, the extended virtual machine 
instructions being combinations of virtual machine 
instructions and associated sets of succeeding instruction 
information, wherein the read unit may read an extended 
virtual machine instruction from the instruction storing 
unit, and wherein the decoding-executing unit may specify 
and execute operations corresponding to the extended virtual 
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machine instruction . 

In this way, since an extended virtual machine 
instruction is a combination of a virtual machine 
instruction and next instruction information, next 
instruction information need not be processed or stored 
separately. This means that a virtual machine with a 
similar architecture to a conventional computer can be 
provided. 

The fist object can be also achieved by the virtual 
machine compiler of Claim 7. According to Claim 7, the 
compiler generates programs /for a virtual machine with a 
stack architecture that includes a stack, the compiler 
including: an instruction sequence converting unit for 
converting a source program into a virtual machine 
instruction sequence executable by the virtual machine; a 
succeeding instruction information generating unit for 
generating sets of succeeding instruction information 
corresponding to virtual machine instructions in the virtual 
machine instruction sequence, each set of succeeding 
instruction information indicating a change in a storage 
state of data in the stack due to execution of a virtual 
machine ^struction executed immediately after a virtual 
machine instruction corresponding to the set of succeeding 
instruction information; and ari associating unit for 
associating each set of generated succeeding instruction 
information with a corresponding virtual machine instruction 
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and outputting the set of succeeding instruction information 
and the virtual machine instruction. 

In this way, the above virtual machine compiler 
generates not only virtual machine instructions but also 
5 next instruction information which can be used by a virtual 

machine to absorb true data dependencies. Thus, the present 
virtual r^chine compiler can generate programs for a virtual 
machine whose execution speed is improved by having data 
dependencies absorbed. 
y° The second object can be achieved by the virtual 

f\3 machine of Claim 8. According to Claim 8, the virtual 

jj¥j machine executes a virtual machine instruction sequence 

pS. under control of a real machine, the virtual machine 

•'^ including: an instruction storing unit for storing the 

1315 virtual machine instruction sequence; a read unit for 

a reading a virtual machine instruction in the virtual machine 

.|p instruction sequence from the instruction storing unit; and 

~ a decoding-executing unit for specifying and executing 

operatl-r? corresponding to the virtual machine instruction, 
wherein the decoding-executing unit includes a branch 
instruction judging unit for judging if the virtual machine 
instruction is a branch instruction and an interrupt 
handling unit for detecting, if the virtual machine 
instruction is judged to be a branch instruction, whether 
there is an interrupt request/ and, if so, performing a 
corresponding interrupt handling in addition to executing 
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the branch instruction. 

In this way, an interrupt handling is only performed 
whenever a branch instruction is executed, which is 
sufficf-r* 1 for most virtual machine programs. This 

5 suppresses decreases in execution speed caused by performing 

interrupt more frequently. 

Here/ the decoding-executing unit may further include 
a real machine instruction sequence storing unit for storing 
real machine instruction sequences corresponding to every 

0 virtual machine instruction and real machine instruction 

sequences for having interrupt handling performed 
corresponding to each interrupt request and an executing 
unit for executing a real machine instruction sequence 
corresponding to the virtual machine instruction read by the 

5 read unit, wherein if the virtual machine instruction is 

judged be the branch instruction and an interrupt request, 
is detected, the interrupt handling unit has the executing 
unit execute a real machine instruction sequence for having 
the corresponding interrupt handling performed and then the 

!0 real machine instruction sequence corresponding to the 

branch instruction. 

With this construction, * an interrupt handling to be 
additionally performed can be specified by a real machine 
instruction sequence. This realizes a virtual machine 

5 capable of performing an interrupt handling with a simpler 

architecture . 
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me second object can be also achieved by the virtual 
machine of Claim 10,. According to Claim 10, the virtual 
machine executes a virtual machine instruction sequence 
under control of a real machine, the virtual machine 
including: an instruction storing unit for storing the 
virtual machine instruction sequence; a read unit for 
reading a virtual machine instruction in the virtual machine 
instruction sequence from the instruction storing unit; and 
a decoding-executing unit for specifying and executing 
operations corresponding to the read virtual machine 
instruction, wherein the dfecoding-executing unit includes a 
block judging unit for judging if the read virtual machine 
instruction is a virtual machine instruction representative 
of a block, a block being a predetermined number of virtual 
machine instructions and an interrupt handling unit for 
detecting, if the read virtual machine instruction is judged 
to be the representative virtual machine instruction, 
whether there is an interrupt request to the virtual 
machine, and if so, performing a corresponding interrupt 
handling in addition to executing the representative virtual 
machine instruction. 

In this way, an interrupt handling is performed every 
time a predetermined number of virtual machine instructions 
are executed, and a frequency to perform interrupt handling 
can be controlled by changing this number in advance. This 
avoids decreases in execution speed caused by performing 
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interrupt handling more frequently . 

Here, the decoding-executing unit may include a real 
machine instruction sequence storing unit for storing a 
plurality of real machine instruction sequences 
corresponding to every virtual machine instruction and at 
least one real machine instruction sequence for having 
interrupt handling performed in response to an interrupt 
request and an executing unit for executing a real machine 
instruction sequence corresponding to the read virtual 
machine instruction, wherein the block judging unit may 
judge that the read virtual. machine instruction is a virtual 
machine instruction representative of the block when a 
number of virtual machine instructions that have been read 
is equal to a multiple of the predetermined number and 
wherein if the read virtual machine instruction is judged to 
be a representative virtual machine instruction and an 
interrupt request has been detected, the interrupt handling 
unit may have the executing unit execute a real machine 
instruction sequence for having the interrupt handling 
performed and then the real machine instruction sequence 
corresponding to the representative virtual machine 
instruction . 

With this construction, an interrupt handling to be 
additionally performed can be specified by a real machine 
instruction sequence. As a result, a virtual machine that 
is capable of performing an interrupt handling with a 
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simpler architecture can be achieved. 

The third object may be achieved by the virtual 
machine of Claim 12. According to Claim 12 , the virtual 
machine executes a virtual machine instruction sequence 
under control of a real machine, the virtual machine 
including: a real machine program storing unit for storing a 
plurality of subprograms composed of real machine 
instructions; an instruction storing unit that includes a 
first area for storing the virtual machine instruction 
sequence and a second area for storing a plurality of 
pointers to the subprogram's in the real machine program 
storing unit; a read unit for reading a virtual machine 
instruction in the virtual machine instruction sequence from 
the first area in the instruction storing unit; and a 
decoding-executing unit for specifying and executing 
operations corresponding to the read virtual machine 
instruction, wherein the decoding-executing unit includes an 
area judging unit for judging whether the virtual machine 
instruction is an instruction that transfers control flow to 
a location in the second area and an address converting- 
executing unit for executing, if the virtual machine 
instruction is judged to be an instruction that transfers 
control flow to a location in the second area, a subprogram 
indicated by a pointer stored in the location. 

With this construction, execution of either a virtual 
machine function or a real machine function is solely 
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determined by a corresponding location in an area of the 
memory map in the virtual machine, so a setting of whether a 
virtual machine function or a real machine function is 
executed for a function can be easily changed. This makes 
it possible to use "native-coding" in virtual machine 
programs for real machines with different architectures* 

Here, the first area and the second area in the 
instruction storing unit may be two adjacent storage areas 
whose boundary is marked by an address, and the area judging 
unit may judge, when the read virtual machine instruction is 
a call instruction for a subprogram, whether the virtual 
machine instruction is an instruction that transfers control 
flow, by comparing a call address of the call instruction 
with the address. 

ITith this construction, control over switches between 
executing a virtual machine function and a real machine 
function can be easily achieved by shifting the boundary 
line between areas in the memory map of the virtual machine. 
As a result, virtual machines that have improved execution 
speed and are suited to different real machine environments 
can be realized. 

The fourth object can be achieved by the virtual 
machine of Claim 14. According to Claim 14, the virtual 
machine executes a virtual machine instruction sequence 
under control of a real machine, the virtual machine 
including: an instruction storing unit for storing the 
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virtual machine instruction sequence; a read unit for 
reading a virtual machine instruction in the virtual machine 
instruction sequence from the instruction storing unit; and 
a decoding-executing unit for specifying and executing 
operations corresponding to the read virtual machine 
instruction, wherein the instruction storing unit is a 
plurality of instruction blocks that constitute the virtual 
machine instruction sequence, the instruction blocks 
corresponding to basic blocks, wherein the instruction 
blocks each include: an identifier area for storing an 
identifier that specifies I start position of the 
instruction block in the instruction storing unit; a non- 
branch instruction area for storing non-branch instructions 
belonging to a corresponding basic block; and a branch 
instruction area for storing at least one branch instruction 
belonging to the corresponding basic block, wherein each 
branch instruction stored in the branch instruction area 
designates a branch destination using an identifier stored 
in one of the identifier areas, and wherein if the read 
virtual machine instruction is a branch instruction, the 
decoding-executing unit has control flow branch to a start 
position of a non-branch instruction area in an instruction 
block having an identifier designated by the branch 
instruction as a branch destination. 

•With this construction, there is always only one 
entry point for each instruction block, which is the start 
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of the instruction block* as a result, the address analysis 
for branch destinations of branch instructions is 
simplified, and the timing taken by compiling is reduced. 
Also, by caching instructions in instruction block units, 
the judgment processing regarding the cache boundaries is 
simplified, and decreases in execution efficiency that occur 
when a cache is provided for the virtual machine can be made 
smaller than in conventional techniques . 

Here, the decoding-executing unit may include a 
program counter composed of (a) an identifier register for 
storing an identifier of ah instruction block to which a 
virtual machine instruction to be read belongs and (b) an 
offset counter for storing an offset that indicates a 
relative storage position of the virtual machine instruction 
in the instruction block, wherein the read unit may read the 
virtual machine instruction based on the identifier and the 
offset in the program counter, wherein the decoding- 
executing unit may update, if the read virtual machine 
instruction is the branch instruction, the program counter 
by writing the identifier designated as the branch 
destine*; -rt by the branch instruction into the identifier 
register and by setting an initial value in the offset 
counter, and if the read virtual machine instruction is a 
non-branch instruction, update "the program counter by 
incrementing the offset counter, and the read unit may read 
a virtual machine instruction to be executed next based on 
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the program counter updated by the decoding-executing unit. 

Accordingly, each instruction block is specified only 
by a value of the identifier segment register, and each 
relative instruction storage position of a virtual machine 
5 instruction by a value of the offset counter. As a result, 

an address converting technique according to a conventional 
"segment method" can be used. 

Here, the decoding-executing unit may include a real 
machine instruction sequence storing unit that stores a 
plurality of real machine instruction sequences that each 
correspond to a different virtual machine instruction, the 
W instruction blocks in the instruction storing unit each may 

:|p include a decoded data sequence area for storing a decoded 

^ data sequence that specifies real machine instruction 

■'gs sequences in the real machine instruction sequence storing 

unit, the real machine instruction sequences corresponding 
r; 0* to virtual machine instructions stored in the non-branch 

instruct ?n area and the branch instruction area of the 
instruction block, wherein if a decoded data sequence is 
stored in an instruction block where reading is to be 
performed, the read unit may read a set of decoded data in 
the decoded data sequence instead of a virtual machine 
instruction, and if not, the read unit may read the virtual 
machine instruction and then generate a set of decoded data 
to specify a real machine instruction sequence in the real 
machine instruction sequence storing unit that corresponds 
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to the virtual machine instruction, and wherein the 
decoding-executing unit may read from the real machine 
instruction sequence storing unit the real machine 
instruction sequence specified by the set of decoded data 
that has been either read or generated by the read unit, and 
executes the real machine instruction sequence* 

With this construction of the virtual machine, in 
addition to the effects achieved in the virtual machine of 
Claim 15 that manages a virtual machine program in units of 
instruction blocks, a time to decode a virtual machine 
instruction is shortened fbr : the instruction blocks that 
already have a decoded data sequence. This is because the 
decoded data sequence is executed directly instead of 
virtual machine instructions. As a result, the execution 
speed c' the virtual machine is improved . 

Here, the decoded data sequence area in the 
instruction storing unit may include a flag area for storing 
a flag that indicates whether the decoded data sequence is 
stored in the decoded data sequence area, wherein the 
decoding-executing unit may include a current flag storing 
unit for storing a flag that is read from a flag area in a 
branch destination instruction block by the decoding- 
executing unit when executing a branch instruction, and 
wherein the read unit may read a set of decoded data or a 
virtual machine instruction depending on the flag in the 
current flag storing unit. 
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For this construction, a flag indicating whether a 
decoded data sequence exists is provided to each instruction 
block and read from the instruction block to be held by the 
virtual machine. As a result, when executing virtual 
machine instructions in an instruction block that has a 
decoded data sequence, the virtual machine need not refer to 
a flag every time it executes one virtual machine 
instruction . 

Here, each instruction block in the instruction 
storing unit may further include a flag area for storing a 
flag that indicates whetheV a decoded data sequence is 
stored 1'.. the decoded data sequence area of the instruction 
block, and the decoding-executing unit may include a decoded 
data sequence writing unit for judging, after a branch 
instruction has been executed, whether the instruction block 
designated as the branch destination by the branch 
instruction stores a decoded data sequence by referring to a 
flag stored in a flag area of the instruction block, and if 
no decoded data sequence is stored, having a virtual machine 
instruction sequence in the instruction block read, decoding 
the read virtual machine instruction sequence to produce a 
decoded data sequence, and writing the decoded data sequence 
into a decoded data sequence area in the instruction block. 

ior rhis construction, a decoded data sequence is 
generated when an instruction block is executed for the 
first time. As a result, when the same instruction block 
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needs to be repeatedly executed as in loop processing, the 
time required for executing instructions corresponding to 
the block is reduced from the second execution of the block 
onwards . 

5 The fifth object can be achieved by the virtual 

machine of Claim 19. According to Claim 19, the virtual 
machine executes a virtual machine instruction sequence 
under control of a real machine, the virtual machine 
including: an instruction storing unit for storing a 
:1 Jo compressed virtual machine instruction sequence to be 

W executed; a read unit for xeading a compressed virtual 

Oj machine instruction in the compressed virtual machine 

IP instruction sequence from the instruction storing unit and 

decompressing the compressed virtual machine instruction to 
■ji5 generate a decompressed virtual machine instruction; and a 

O decoding-executing unit for specifying and executing 

/ijS operations corresponding to the decompressed virtual machine 

instruction, wherein the instruction storing unit is a 
plurality of instruction blocks containing compressed 
20 virtual machine instructions constituting the compressed 

virtual machine instruction sequence, the instruction blocks 
corresponding to basic blocks, wherein the instruction 
blocks each include: an identifier area for storing an 
identifier that specifies a start position of the 
25 instruction block in the instruction storing unit; a non- 

branch instruction area for storing compressed non-branch 
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instructions belonging to a corresponding basic block; and a 
branch instruction area for storing at least one compressed 
branch instruction belonging to the corresponding basic 
block, wherein each compressed branch instruction stored in 
a branch instruction area designates a branch destination 
using an' identifier stored in one of the identifier areas, 
and wherein if the decompressed virtual machine instruction 
is a branch instruction, the decoding-executing unit has 
control flow branch to a start position of a non-branch 
instruction area in an instruction block having an 
identifier designated by the branch instruction as a branch 
destination. 

For this construction, the compressed virtual machine 
program is stored in units of the instruction blocks based 
on basic blocks and is decompressed by the decoding- 
executing unit. As a result/ malfunctions caused when 
compressed bit sequences are mistakenly decoded starting 
midway through do not occur to this virtual machine* 

Here, each instruction block may include a 
decompression table area for storing a decompression table 
for use during decompression of compressed virtual machine 
instructions in the instruction, block, the decompression 
table containing at least one combination of a compressed 
virtual machine instruction stored in the instruction block 
and a corresponding decompressed virtual machine instruction 
and wherein the read unit may read the compressed virtual 
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machine instruction from the instruction storing unit and 
decompresses the compressed virtual machine instruction by 
referring to a decompression tabic in an instruction block 
to which the compressed virtual machine instruction belongs 
to generate the decompressed virtual machine instruction. 

With this virtual machine, each instruction block 
stores a decompression table, and a different decompression 
table is referred for execution of instructions belonging to 
each instruction block. Accordingly, the present virtual 
machine assures that even when each instruction block is 
compressed in a different ^format, decompression can be 
correctly performed. 

The sixth object can be achieved by the JIT compilers 
of Claims 25 and 26. According to Claim 25, the jit 
compiler is for use with a virtual machine that executes a 
virtual machine instruction sequence under control of a real 
machine/ the JIT compiler converting parts of the virtual 
machine instruction sequence into real machine instruction 
sequences before execution, the JIT compiler including: a 
block start information receiving unit for receiving an 
input of block start information for each virtual machine 
instruction that composes the virtual machine instruction 
sequence, the block start information showing whether a 
corresponding virtual machine instruction would correspond 
to a start of a basic block if the virtual machine 
instruction sequence were divided into basic blocks; a 
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converting unit for converting virtual machine instructions 
in the virtual machine instruction sequence into real 
machine instruction sequences; and an outputting unit for 
rearranging the real machine instruction sequences produced 
5 by the converting unit into basic block units in accordance 

with the block start information received by the block start 
information receiving unit. Here, this JIT compiler may 
further include a branch violation judging unit for judging, 
when a real machine instruction at a start of a produced 

3° real machine instruction sequence corresponds to a virtual 

machine instruction whose i>lbck start information indicates 
that the virtual machine instruction would be a start of a 

IP basic block, whether the real machine instruction is going 

to be arranged in an address that violates an address 
pl5 alignment restriction of the real machine, wherein if the 

; : :rf' real machine instruction is going to be arranged in an 

-ff address that violates the address alignment restriction, the 

outputting unit may rearrange the real machine instruction 
sequence so that the real machine instruction is not 
20 arranged in the address. 

Accordingly, without performing the complicated 
processing for analyzing branch, destinations of branch 
instructions, the present JIT compiler can produce a real 
machine instruction program at a higher speed in which 
25 branch destinations are arranged at addresses complying with 

a two-word alignment. 
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Here, the outputting unit may insert a certain number 
of no-operation instructions at a start of each basic block, 
the number being a number of real machine instructions 
processed during a delay of a delayed branch. 

As a result, the above JIT compiler is capable of 
dealing with delayed branch by inserting no-operation 
instructions at a start of each basic block without 
performing a complicated delayed branch analyzing. 

As has been described, the present invention improves 
execution speed of virtual machines and is especially 
valuable as a technique toS promote efficient and high-speed 
use of shared resources by different types of computers 
connected' on a network environment* 

BRIEF DESCRIPTION OF THE DRAWINGS 

These and other objects, advantages and features of 

the invention will become apparent from the following 

description thereof taken in conjunction with the - 

accompanying drawings which illustrate a specific embodiment 

of the invention. In the drawings; 

Fig. 1 is a block diagram showing a conventional 

virtual machine with a stack architecture; 

Fig. 2 is an explanation drawing that shows a virtual 

machine instruction set used in the conventional technique 

and the present inventions- 
Fig. 3 shows contents of the decode table shown in 
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Fig. 1; 

Fig, 4 shows microprogram lists stored in the 
microprogram storing unit shown in Fig. 1; 

Fig. 5 shows operation contents of real machine 
instructions of the conventional technique and the present 
invention; 

Fig. 6 is a flowchart showing the processing of the 
decoding unit shown in Fig. 1; 

Fig. 7 is a flowchart showing the specific processing 
of step 4506 in Fig. 6; 

Fig. 8 is a flowchart showing the processing of 
decoding unit 4402 in a case where decoded data transmitted 
from the decoding unit is transferred to the executing unit 
via a buffer; 

Fig. 9 is a flowchart showing the processing of the 
executing unit shown in Fig.:l; 

Fig. 10A shows a sample program list; 

Fig. 10B shows the arithmetic expression "2* (3+4)" 
based on Fig. 10A; 

Fig. 10C shows decoded data transmitted from the 
decoding unit in order; 

Fig. 11 shows changing internal states of the 
conventional virtual machine when the executing unit of the 
virtual machine processes the decoded data shown in Fig. 
IOC; 

Figs. 12A-12D show a microprogram list for the 
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conventional virtual machine that uses the TOS variable ; 

Fig. 13 shows changing internal states of the 
conventional virtual machine that stores microprograms shown 
in Figs. 12A-12D when the virtual machine executes the 
5 virtual machine program shown in Fig, 10A; 

Fig. 14 is an explanation drawing that shows 
abbreviated symbols for pipeline stages; 

Fig. 15 shows an ideal pipeline flow of the standard 

machine ; 

Fig. 16 shows an ideal pipeline flow of the 
■W superscalar machine; V 

ffiB Fig. 17 shows a pipeline flow of the standard machine 

when hazzards occur; 
J" Fig. 18 shows a pipeline flow of the superscalar 

M[5 machine when hazzards occur; 

2 Fig. 19 shows a pipeline flow when two clock cycles 

vip- need to pass before values obtained through memory access 

can be used in the case shown in Fig. 17; 

Fig. 20 shows a pipeline flow when two clock cycles 
20 need to pass before values obtained through memory access 

can be used in the case shown in Fig. 18; 

Fig. 21 shows a pipeline flow for the standard 
machine when instructions Al and A2 are instructions that 
indicate jump destinations using a register; 
25 Fig. 22 shows a pipeline flow for the superscalar 

machine when instructions Al and A2 are instructions that 



49 



indicate a jump destination using a register; 

Fig. 23 shows a pipeline flow when the virtual 
machine of the first conventional technique is realized by a 
standard machine where one clock cycle needs to pass before 
values obtained through memory access can be used and the 
virtual machine program shown in Fig. 10A is executed; 

Fig. 24 shows a pipeline flow corresponding to Fig. 
23 when the virtual machine of the first conventional 
technique is realized by a superscalar machine; 

Fig. 25 shows a pipeline flow for the standard 
machine when two clock cycies need to pass before values 
obtaiij.cvJ u-ixrough memory access can be used; 

Fig. 2 6 shows a pipeline flow corresponding to Fig. 
25 in the case of the superscalar machine; 

Fig, 27 shows a virtual machine program list as a 

sample; 

Fig. 28 is a flowchart for the sample program list 
shown in Fig. 27; 

Fig. 2 9 is a conversion table that is used by the 
conventional JIT compiler; 

Fig. 30 shows the code arrangement of the real 
machine program that is obtained when the sample virtual 
machine program shown in Fig. 27 is compiled using the 
conversion table shown in Fig. 29; 

Fig. 31A shows an example of a compression table ; 

Fig. 31B shows an example code that is obtained using 
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the compression table shown in Fig. 31A; 

Fig. 32 is a drawing for explaining a problem likely 
to occur to the conventional virtual machine that includes a 
cache memory; 

Fig. 33 shows the case where the sample virtual 
machine program shown in Fig. 27 is stored in the cache 
memory , with the boundary lines A, B, and C marking the 
boundaries between the cache blocks ; 

•ily. 34 is a hardware construction drawing of a 
computer system where the virtual machine systems of the 
first to ninth embodiment s5 are used; 

Fig. 35 is a block diagram showing the construction 
of the virtual machine in the first embodiment; 

Fig. 3 6A shows the next instruction information 
stored in the next instruction information storing unit of 
the virtual machine shown in: Fig. 35; 

Fig. 36B shows the virtual machine program that is 
stored in the instruction storing unit and that corresponds 
to the next instruction information shown in Fig. 36A; 

Fig. 37 shows stored contents of the decode table of 
the firsL embodiment; 

Figs. 38A and 38B show microprograms corresponding to 
virtual machine instructions "Push" assigned "U" and "D M , 
respectively; 

Figs. 39A and 39B show microprograms corresponding to 
virtual machine instructions "Add" assigned "U" and "D", 
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respectively; 

Figs. 4 OA and 4 OB show microprograms corresponding to 
virtual machine instructions "Mult" assigned "U" and "D", 
respectively ; 

Fig. 41A shows a microprogram corresponding to the 
latter f of the microprograms assigned "U" shown in Figs, 
39A and 4 OA; 

Fig* 41B shows a microprogram corresponding to the 
latter half of the microprograms assigned "D" shown in Figs. 
39B and 4 OB; 

Fig. 42 is a state- transition diagram showing changes 
in virtual machine instruction types to be executed by the 
virtual machine of the first embodiment ; 

Fig. 43 is a flowchart showing the processing of the 
decoding unit of the virtual machine of the first 
embodiment; 

Fig. 44 is a flowchart showing the initial half of 
the detailed processing of step 4907 for table searching in 
Fig. 43; 

Fig. 45 is a flowchart showing the latter half of the 
detailed processing of step 4907 for table searching in Fig. 
43; 

Fig. 4 6 shows a decoded data sequence successively 
outputted from the decoding unit to the executing unit of 
the virtual machine in the first embodiment; 

Figs. 47A and 47B show changes in the internal states 
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of the virtual machine when its executing unit operates 
according to the decoded data sequence shown in Fig. 4 6; 

Fig. 48 shows a pipeline flow for the standard real 
machine when one clock cycle needs to pass before values 
5 obtained through memory access can be used; 

Fig, 49 shows a pipeline flow for the superscalar 
real machine when one clock cycle needs to pass before 
values obtained through memory access can be used; 

Fig. 50 shows a pipeline flow for the standard real 
yiO machine when two clock cycles need to pass before values 

pj obtained through memory access can be used; 

iffl Fig. 51 shows a pipeline flow for the superscalar 

;ft real machine when two clock cycles need to pass before 

values obtained through memory access can be used; 
H ^ 15 t Fic ?- 52 is a block diagram showing the construction 

O of the virtual machine compiler in the first embodiment; 

: : p Fig. 53 shows the data construction of the source 

^ program to be inputted into the instruction sequence 

converting unit of the virtual machine compiler; 
20 Fi 9- 54 shows the data construction of each node 

shown in Fig. 53; 

Fig. 55 is a flowchart showing a general procedure of 
the instruction sequence converting unit of the virtual 
machine compiler; 
25 Fig. 56 is a flowchart' showing the detailed 

processing of step 5405 in Fig. 55; 



irig. 57 is a flowchart showing the detailed 
processing of step 5613 in Fig, 56; 

Fig. 58 is a flowchart showing the processing of the 
next instruction information generating unit of the virtual 
machine compiler; 

Fig. 59 is a flowchart showing the processing of the 
relation associating unit of the virtual machine compiler; 

Fig. 60 is a block diagram showing the construction 
of the virtual machine in the second embodiment; 

Fig. 61 is a flowchart showing the detailed 
processing for table sear ch and decoded data output by the 
decoding unit of the virtual machine; 

Fig. 62 is a flowchart showing the processing of the 
branch instruction detecting unit of the virtual machine; 

Fig. 63 is a flowchart showing the processing of the 
instruction inserting unit of the virtual machine; 

Fig. 64 is a block diagram showing the construction 
of the virtual machine in the third embodiment; 

Fig. 65 is a flowchart showing the processing of the 
block converting unit of the virtual machine ; 

Fig. 66 is a block diagram showing the construction 
of the virtual machine of the fourth embodiment; 

Fig. 67 shows a memory map of the instruction storing 
unit of the virtual machine; 

Fig. 68 shows the construction of the real machine 
function table shown in Fig. 67; 
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Fig. 69 is a flowchart showing the processing of the 
execution unit of the virtual machine; 

Fig. 7 0 shows a modification example of a memory map 
of the instruction storing unit of the virtual machine; 

Fig. 71 is a block diagram showing the construction 
of the virtual machine in the fifth embodiment; 

Fig. 72 shows an example of states of virtual machine 
programs stored in the instruction storing unit of the 
virtual machine ; 

rig. 73 shows a control flow of the virtual machine 
programs shown in Fig. 12 A 

Fig. 74 shows a data format obtained by the 
addressing by the PC of the virtual machine; 

Fig, 75 is a flowchart showing the processing of the 
branch destination converting unit of the executing unit of 
the virtual machine; 

Fig. 76 shows the address conversion by the branch 
destination converting unit, where logical addresses and 
identifiers in the virtual machine program shown in Fig. 72 
are replaced with physical addresses; 

Fig. 77 is a block diagram showing the virtual 
machine compiler in the fifth embodiment; 

Fig. 78 shows the construction of the branch address 
conversion table of the virtual machine compiler; 

Fig. 79 is a flowchart showing the processing of the 
block converting unit of the virtual machine compiler; 
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Fig. 80 is a flowchart showing the detailed 
processing of step 7607 in Fig. 79; 

Fig. 81 is a flowchart showing the detailed 
processing of step 7704 in Fig. 79; 
5 Fig. 82 is a flowchart showing the detailed 

processing of step 7609 in Fig. 79; 

£ig. 83 shows the relationship between the PC, the 
instruction block storing areas, and the cache table when 
caching is performed by the virtual machine in instruction 
-10 block units; 

V Fig. 84 is a flowchart showing the instruction 

ffi processing of branch instructions by the executing unit when 

jf instructions are cached in instruction block units in the 

~" virtual machine; 

Fig. 85 is a block diagram showing the construction 
3 of the virtual machine in the sixth embodiment; 

p Figs. 86A to 86C show examples of the stored state of 

~ virtual machine programs in the instruction storing unit; 

Fig. 87 is a flowchart showing the processing of the 
20 decoding unit of the virtual machine; 

Fig. 8 8 is a flowchart showing the processing of the 
executing unit of the virtual machine; 

Fig. 89 is a flowchart showing the control performed 
for the decoding unit when the executing unit of the virtual 
25 machine executes a branch instruction; 

Fig. 90 is a block diagram showing the construction 
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of the virtual machine in the seventh embodiment; 

Fig. 91 is a flowchart showing the processing of the 
decoded instruction sequence writing unit, the current flag 
read control unit, and the branch destination converting 
unit when the virtual machine executes a branch instruction; 

Fig. 92 is a flowchart showing the detailed 
processing of step 9110 in Fig. 91; 

Fig. 93 is a flowchart showing the operation of the 
decoding unit when viewed from the executing unit; 

Fig. 94 is a block diagram showing the construction 
of the virtual machine in 'the eighth embodiment; 

Fig. 95A shows an example of the decompression table 
stored in the restoring information storing unit of the 
virtual machine; 

Fig, 95B shows the rules governing codes in the 
decompression table shown in^ Fig. 95A; 

Figs- 96A to 96C show examples of the stored states 
of a virtual machine program that is stored in the - 
instruction storing unit of the virtual machine; 

Fig, 97 is a flowchart showing the processing of the 
decoding unit of the virtual machine; 

Fig. 98 is a flowchart showing the detailed 
processing of step 9602 in Fig. 97; 

Fig. 99 is a block diagram showing the construction 
of the entire compiler system including the JIT compiler of 
the ninth embodiment; 
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i'ig. 100 is a flowchart showing the processing of the 
block start information generating unit of the virtual 
machine compiler; 

Fig, 101 is a flowchart showing the processing of the 
real machine instruction converting unit, the branch 
position amending unit, and the real machine address storing 
unit; 

Fig. 102 is a table showing the block start 
information generated by the block start information 
generating unit, the timing of the generation of "Nop" real 
machine instructions generated by the branch position 
amending unit of the JIT compiler, and other related 
information; and, 

Fig. 103 shows a modification example of a virtual 
machine instruction format used by the virtual machine of 
the present invention, 

DESCRIPTION OF THE PREFERRED EMBODIMENT (S) 

The following explains embodiments of the present 
invention, with reference to figures. 

First Embodiment 

The following describes the virtual machine system of 
the first embodiment that can absorb a true data dependency. 

Fig. 34 shows a hardware construction of the computer 
system 200 that operates the virtual machine system of the 
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present embodiment. The computer system 200 comprises a 
real machine 201, a memory 200, a keyboard 203, a mouse 204, 
a display screen 206 # a hard disks 207, a network card 208 , 
and internal busses 205A-205C that connect these elements. 
This hardware construction is the same as that of a normal 
personal computer. 

The virtual machine and the virtual machine compiler 
of the present embodiment are programs written with 
instructions for the real machine 201. These programs are 
stored in the hard disks 207 and loaded into the memory 202 
according to instructions ^from the user or from another 
program that is being executed by the real machine 201. The 
real machine 201 is a CPU that decodes and executes the real 
machine instructions shown in Fig, 5 in the same way as 
described in the prior art. 

Virtual Machine Construction 

Fig. 35 is a block diagram showing the construction 
of a virtual machine 100 of the present embodiment. This 
figure corresponds to Fig. l in the explanation of the prior 
art. This virtual machine 100 includes a next instruction 
information storing unit 101, ah instruction storing unit 
102, a decoding unit 103, an executing unit 110 and a stack 
120. 

The instruction storing unit 102 is a storage area to 
Store a virtual machine program to be processed, and the 
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next instruction information storage unit 101 is an area to 
store sets of next instruction information that correspond 
to virtual machine instructions constituting the virtual 
machine program. A set of next instruction information 
refers to one-bit information indicating whether a virtual 
machine instruction that immediately follows a currently 
executed instruction is an instruction whose execution 
results in the level of the stack 120 being increased or 
decreased. Next instruction information indicating the 
former is written as "U" and the latter as "D". This 
information is generated together with the virtual machine 
program from a source program using a virtual machine 
compiler of the present embodiment, which will be described 
later. 

Figs. 36A and 36B respectively show examples of next 
instruction information stored in the next instruction 
information storing unit 101 and virtual machine codes 
stored in the instruction storing unit 102. These virtual 
machine codes and next instruction information correspond to 
a virtual machine program with the same contents as the 
virtual machine program shown in Fig. 10A, i.e. a 
calculation of "2* (3+4)". For example, next instruction 
information "U" is stored in locations specified by 
addresses "1" and "2" in the next instruction storing unit 
101, since the corresponding virtual machine instruction 
"Push 2" in addresses "1" and "2" in the instruction storing 
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unit 102 precedes an instruction "Push 3", that raises the 
level of the stack 120. 

The decoding unit 103 reads virtual machine 
instructions successively from the instruction storing unit 
102, decodes the virtual machine instruction referring to a 
corresponding set of next instruction information stored in 
the instruction storing unit 102, and outputs the result of 
the decoding to the executing unit 110. The decoding unit 
103 includes a next instruction information reading unit 
104, an instruction reading unit 105, a search unit 106, a 
program counter (PC) 107, ^arid a decode table 108. 

The PC 107 is a storage area to hold the address of a 
virtual machine instruction to be read next from the 
instruction storing unit 102 and the address of the 
corresponding next instruction information in the next 
instruction information storing unit 101. In the present 
embodiment, these addresses are assigned the same address 
number and are updated by the executing unit 110. The PC 
107 is allocated physically to register #2 (r2) of the real 
machine 201. 

The instruction reading unit 105 reads a virtual 
machine instruction from the instruction storing unit 102 
according to the address indicated by the PC 107 and outputs 
the read virtual machine instruction to the search unit 106. 
In the same way, the next instruction information reading 
unit 104 reads a set of next instruction information from 
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the next instruction information storing unit 101 specified 
by the address in the PC 107 and outputs the read 
information to the search unit 106. This processing by the 
next instruction unit 104 is synchronized with the 
instruction reading unit 105. 

The decode table 108 stores the combinations of the 
next instruction information and opcodes corresponding to 
all the virtual machine instructions shown in Fig. 2 to be 
decoded and executed by the virtual machine 100, a jump 
address of a microprogram in the microprogram storing unit 
111 to which each combination jumps , and a number of 
operand* -t-hat accompany each opcode. Each opcode has one 
combination with the next instruction information "U", and 
one with the next instruction "D" . As in the prior art, 
opcodes are 1-byte long, and operands are counted in units 
of one byte. 

Fig. 37 shows the stored contents of the decode table 
108, which corresponds to the decode table 4406 shown in 
Fig. 3 in the description of the prior art. Unlike the 
conventional decode table 4406, the jump address 108C and 
the number of operands 108D in this decode table 108 
correspond to two cases when the opcode 108A is associated 
with next instruction information 108B "U" and "D" . As one 
exampl-.- £ :r the opcode "Push", a jump address to a 
microprogram that processes "Puali" assigned "u" is provided 
for cases when the opcode "Push" is associated with the next 
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instruction information "U", and a jump address to a 
microprogram that processes "Push" assigned "D" is provided 
for cases when the opcode "Push" is associated with the next 
instruction information "D" . 

The search unit 106 receives an opcode of a virtual 
machine instruction from the instruction reading unit 105 
and the next instruction information from the next 
instruction reading unit 104 as a combination, specifies an 
entry corresponding to the combination out of the decode 
table 108, reads a jump address stored in the specified 
entry to output it as the decoded data to the executing unit 
101. 

The executing unit 110 executes a microprogram 
corresponding to a virtual machine instruction using the 
decoded data sent from the search unit 106. This executing 
unit 110 includes a microprogram storing unit 111 and a 
stack pointer (SP) 112. 

The microprogram storing unit 111 stores 
microprograms corresponding to the combinations of the 
virtual machine instructions to be decoded and executed by 
the virtnal machine 100 and the next instruction 
information. These microprograms will be explained later in 
detail . 

The SP 112 is a storage area to store an address of 
the top of the stack 120 as described in the prior art, and 
is allocated physically to a register #3 (r3) of the real 
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machine 201. 

The stack 120 is a temporary LIFO storage area used 
by the executing unit 110 to execute microprograms for the 
decoded virtual machine program. This stack 120 includes 
the TOS variable 121, the SOS (Second Of Stack) 122 and the 
memory stack 123. The TOS variable 121 is a storage area 
for a value at the top of the stack 120 and is physically 
allocated to register #0 (rO) of the real machine 201. The 
SOS variable 122 is a storage area for a value on the second 
level of the stack 120 and is physically allocated to 
register #4 <r4) of the real machine 201. The memory stack 
123 is a storage area for values on the third and lower 
levels and is allocated physically to the memory 202. 

Contents of the Microprogram Storing Unit 111 

Figs. 38A and 38B respectively show microprograms in 
the microprogram storing unit m that correspond to the 
virtual 7"=?chine instructions "Push" assigned "U" and "Push" 
assigned "D". Figs. 39A, 39B, 40A, and 40B show 
microprograms corresponding to virtual machine instructions 
"Push" assigned "U" and "D", and virtual machine 
instructions "Mult" assigned "U" and "D". The instruction 
sequence shown in Fig. 41A forms the common latter part of 
the microprograms shown in Figs. 39A and 40A that correspond 
to virtual machine instruction's assigned "U". In the same 
way, the instruction sequence shown in Fig. 4 IB forms the 
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common latter part of the microprograms shown in Figs. 39B 
and 4 OB that correspond to virtual machine instructions 
assigned "D". The operation content of each real machine 
instruction in these microprograms are shown in Fig. 5. 
5 By comparing these microprograms with the 

conventional microprograms shown in Figs. 4A-4D and 12A-12D, 
it can be seen that the microprograms in the microprogram 
storing unit 111 of the virtual machine 100 in the present 
embodiment have the following characteristic. That is, with 
■^10 the present embodiment, different microprograms are prepared 

f or a same type of virtual^ machine instruction and are 
jip selectively used depending on the next instruction 

:|p information assigned to the virtual machine instruction. By 

'J" considering how stack handling will be performed during the 

Jl5 executor of the next virtual machine instruction, needless 

stack operations and pipeline disturbances due to true data 
# dependency can be avoided. For instance, while the 

microprogram shown in Fig. 38B is for the virtual machine 
instruction "Push", it does not include an instruction to 
20 push a value stored in the SOS variable 122 to the memory 

stack 123 because the next instruction information assigned 
to this virtual machine instruction is "D", meaning that the 
execution of the next instruction will result in a pop. In 
this way, needless pushes to the memory are avoided in 
25 advance . 

The virtual machine 100 has also another 
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characteristic in that not only the storage area, at the top 
of the stack 120 (the TOS variable 121), but also the 
storage area on the second level of the stack 120 (the SOS 
variable 122) are allocated to registers, not the memory. 
5 By doing so, both values used by an operation such as an 
addition can be held in registers, so that data transfer 
between the real machine 201 and the memory 202 can be 
performed less frequently. For instance, when an addition 
is performed, no data transfer between the registers and the 
O10 memory 202 is necessary. 

SHU Fig. 42 is a stated transition diagram showing changes 

iffi in virtual machine instruction types. Here, each state in 

the state transition corresponds to an instruction type for 
each virtual machine instruction to be executed by the 
GU5 virtual machine 100 of the present embodiment. These 

>S instruction types are obtained by classifying all the 

^ combinations of virtual machine instructions to be decoded 

and executed by the virtual machine 100 and next instruction 
information, into the groups or instruction types, as 
20 indicated in the ovals in the figure, according to 

operations performed in the stack 120. Three numbers 
"X,Y(Z)" enclosed by each circle respectively denote a 
number of values used out of the stack by an operation, an 
increa*-; the number of stack levels due to the execution 
25 of the operation, and the next' instruction information. For 

example, the "2,-l(U) ,f instruction type represents all 
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virtual machine instructions that require two values for 
their operation, whose execution reduces the stack by one 
level/ and that are assigned the next instruction 
information "U". An example of such is the virtual machine 
instruction "Add" that is assigned the next instruction 
information "U" . The equation next to each oval of 
instruction type shows changes in the TOS variable 1221 and 
the SOS variable 122 resulting from the execution of the 
virtual -machine instruction type in the oval/ with "x" 
denoting an operand . 

In this figure, any instruction belonging to an 
instruction type from which an arrow starts can be executed 
prior to the execution of any instruction belonging to the 
other instruction type indicated by the arrow. Different 
operations that can be executed prior to the execution of a 
next instruction are distinguished by arrows. Hereafter, 
these operations, which can be performed prior to the 
execution of the nest instruction, are called preceding 
operations. In Fig, 42, all arrows that start at a same 
instruction type are the same type. After the execution of 
an ins*-"*- -t ion belonging to the instruction type "2, -1(D)", 
for instance, a preceding operation shown by the arrow 
indicating "Pop SOS" can be executed before a next 
instruction which belongs to one of the following six 
instruction types: M 2,-l(U) ff , '"2, -1(D)", "l/0(U)" r 
"l f 0(D)2, f "1,-1 (U)", and "1, -1(D)". These operations 
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"Pop SOS" pops the value at the top of the memory stack 123 
onto the SOS variable 122. Note that the unconditional 
branch instruction "fir" and the end instruction "Stop" are 
represented by "1,0(0)" or "1,0(0)", which indicate that an 
5 empty operation is performed for a value stored in the TOS 

variabl «, i 71 . 

In this way, this state transition diagram can be 
thought of as showing analyzing results which indicate the 
preceding operations for each virtual machine instruction of 
10 the virtual machine 100. These analysis results are 

reflected in the microprograms stored in the microprogram 
storing unit 111, so that preceding operations (shown by the 
different types of arrows) are included in the corresponding 
microprograms . 

15 

Operation of Virtual Machine : 

The following explains the processing of the virtual 

machii^ ICC whose construction has been explained above. 

Fig. 43 is a flowchart showing the processing of the 
20 decoding unit 103 of this virtual machine. This figure 

corresponds to Fig. 6 in the description of the prior art. 

By comparing Figs. 43 and 6, it' can be observed that the 

processing flow of this decoding unit 103 is basically the 

same as that of the conventional decoding unit 4 402, except 
25 that a new step (step 4906) has been added and that specific 

contents of the processing to search the decode table (step 
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4907) are different. In the new step, the next instruction 
information storing unit 101 reads next instruction 
inform**? on from the next instruction storing unit 101 in 
synchronization with the instruction reading unit 105. 
5 Figs. 44 and 45 are flowcharts respectively showing 

former and latter halves of the detailed processing for 
searching the decode table 108 shown in step 4907 in Fig_ 
43* This figure corresponds to Fig* 7 in the description of 
the prior art* As can be seen by comparing Fig* 7 with 
MLO Figs. 44 and 45, the processing for searching tables in the 

ftp present embodiment differs; from the conventional art in that 

ijBl the following steps are newly added* The search unit 106 

^5 refers to not only an opcode of a virtual machine 

^ instruction outputted from the instruction reading unit 105 

©is but ale- the next instruction information outputted from the 

■jO next instruction information reading unit 104 (steps 5003 

; Jf| and 5007) . The search uniL 106 then finds an entry 

" : ** s corresponding to the combination of the opcode and. the next 

instruction information from the decode table 108 when "Yes" 
20 is given in steps 5008 and 5009, refers to a jump address 
108C and a number of operands 108D, and outputs them as 
decoded data to the executing unit 110. 

Fig. 4 6 shows decoded data to be outputted 
successively to the executing unit 110 when the next 
25 instruction information and the virtual machine instructions 

are stored in the next instruction information storing unit 
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101 and the instruction storing unit 102 as shown in Figs. 
36A and 36B, respectively. Fig. 4 6 corresponds to Fig. 10C 
in the description of the prior art* As shown in the 
figure, jump addresses to microprograms that correspond to 
5 combinations of the next instruction information and a 

virtual machine instruction are outputted. 

The processing of the executing unit 110 is basically 
the same as that of the prior art shown in Fig. 9. That is, 
the executing unit 110 initializes the PC 107 and the SP112 

10 (step 4702) and repeats the following processing from steps 

4703-4707, where the executing unit 110 reads decoded data 
transitu. from the decoding unit 103 (step 4704) and 
branches to a microprogram specified by a jump address 
included in the decoded data for its execution (step 4705) . 

15 Figs. 47A and 47B show the states of the PC 107, the 

SP 112, the TOS variable 121, the SOS variable 122, and the 
stack 4420 before and after the execution of the each 
virtual machine instruction when the executing unit 110 
executes the virtual machine program shown in Fig. 36B. 

20 This figure corresponds to Figs. 11A and 11B, or Figs. 13A 

and 13B in the description of the prior art. A set of next 
instruction information and a virtual machine instruction to 
be executed is shown on the left and right of a slash "/", 
within a transition arrow pattern. The calculation of the 

25 arithmetic expression "2* (34-4) " is completed when PC 4404 

indicates "9", as in the description of the prior art. 
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The characteristics observed in states shown in Figs, 
47A and 47B are, for instance, that after the execution of 
the virtual machine instruction "U/Puah 3", the value in the 
SOS variable 122 has already been stored on the top of the 
memory stack 123, or that after the execution of the virtual 
machine instruction "D/Push 4", contents of the SP 112 and 
the memory stack 123 have not changed. These are the result 
of the execution of the preceding operations based on the 
analv.;.:: ..:;:;;n by the state transition diagram described 
above . 

Figs. 48-51 show pipeline flows of the real machine 
201 when the virtual machine 100 of the present embodiment 
executes a part of the virtual machine program show in Fig. 
36B, more specifically microprograms shown in Figs. 41B and 
40B, that respectively correspond to jump processing of the 
latter half of the virtual machine instruction "Add" 
assigned "D" with address "7" and multiplication processing 
of the first half of the instruction "Mult" assigned "D" 

o 

with address "8". Figs. 48 and 49 show the cases when one 
clock cycle is required before using a value obtained 
throuyn xiL^inory reference (MEM) for a standard machine and a 
superscalar machine, respectively. Figs. 50 and 51 show the 
cases requiring two clock cycles for a standard machine and 
a superscalar machine, respectively. These four figures 
correspond to Figs. 23-26 for the first conventional 
technique . 
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This series of microprograms shown in Figs. 12D and 
12B contain two significant true dependencies between 
instructions. The first exists between instruction "Load" 
for reading a jump address and instruction "Jump" for 

5 jumping to that address. These instructions are included in 

the micjLoprogram for jump processing shown in Fig. 12D 
corresponding to a virtual machine instruction "Add". The 
second true dependency exists between instruction "Load" for 
reading a variable from the memory stack and "Mult" for 

10 multiplication processing. These instructions are included 
in the microprogram shown \ri Fig. 12C corresponding to a 
virtual machine instruction ■'Mult" for multiplication 
processing. 

In the pipeline flow shown in Fig* 48, the processing 
15 is only disturbed by one instruction cancellation caused in 

relation to the execution of : the preceding real machine 
instruction "Jinp", so that the whole processing is completed 
in 11 cycle clocks. As can be seen by comparing this flow 
with that of Fig. 23, the execution speed of this virtual 
20 machine is the same as that of the conventional virtual 

machine described in the first conventional technique when 
the real machine 201 is a standard machine capable of using 
a memory reference value one clock cycle after a memory 
reference. 

25 In the pipeline flow shown in Fig. 49, the first and 

the second data dependencies described in the first 



72 



conventional technique are absorbed by the virtual machine 
100 of the present embodiment. As a result, this pipeline 
flow is L.iiy disturbed by three instruction cancellations 
caused in relation to the execution of the preceding real 
5 machine instruction "Jtap rl", so that the whole processing 

is completed in 9 clock cycles. As can be seen by comparing 
this figure with that shown in Fig. 24, when the real 
machine 201 is a superscalar machine capable of using a 
memory reference value one clock cycle after a memory 
^10 reference, the virtual machine 100 of the present embodiment 

;iV has an execution speed 22& higher than that of the virtual 

m machine described in the first conventional technique that 

requires 11 clock cycles. 
^ in the microprogram corresponding to the virtual 

U;15 machine instruction "Add", instructions for the preceding 
O operations, which are "Lead r4,[r2]" and "Doc r3", for the 

ijj next virtual machine instruction "Mult" are executed, and as 

a result, a sufficient time is secured between a memory 
reference (Load rl, [r2] ) and a branch {3tap rl) so that the 
20 disturbance in the pipeline flow is absorbed. Here, "Load 

r4,[r2]" and "Dec r3» for the preceding operations denote 
the popping from the memory stack 123 to the SOS variable 
122 and a decrementing of the SP 112, respectively. 

In the pipeline shown in Fig. 50, for the same reason 
25 described above, with the virtual machine 100 of the present 

embodiment/ the first and the second data dependencies 
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described in the first prior art are absorbed* As a result, 
the pipeline flow is only disturbed by the cancellation of 
one instruction necessitated by the execution of the 
preceding real machine instruction "Jmp rl", so that the 

5 whole processing is completed in 11 clock cycles. As can be 

seen by comparing this pipeline flow with that shown in Fig, 
25, when the real machine 201 is a standard machine capable 
of using a memory reference value two clock cycles after a 
memory reference, the virtual machine 100 of the present 

0 embodiment has a performance speed 18% higher than that of 

the conventional virtual machine described in the first 
conventional technique that requires 13 clock cycles. 

In the pipeline shown in Fig. 51, a number of hazards 
caused by the first data dependency decreases and the second 

5 data dependency is absorbed by the virtual machine 100 of 

the present embodiment. As a result, the pipeline flow is 
only disturbed by a hazard for one clock cycle resulting 
from the first data dependency and by the cancellation of 
five instructions due to the execution of the preceding real 

!0 machine instruction "Jnp rl", so that the whole processing 

is completed in 10 clock cycles. As can be seen by 
comparing this pipeline flow with that shown in Fig. 26, 
when the real machine 201 is a standard machine capable of 
using a memory reference value two clock cycles after a 

5 memory reference, the virtual machine 100 of the present 

embodiment has a performance speed 30% higher than that of 
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the virtual machine described as the first conventional 
technique that requires 13 clock cycles. 

As has been described, the virtual machine 100 of the 
present embodiment executes a virtual machine instruction by 
5 referring to the corresponding next instruction information 

and performing stack handling, which is a preceding 
operation for the execution of the immediately following 
virtual machine instruction, between executions of two real 
machine instructions that have a true dependency with one 
O10 another. 

rjp Construction of the virtual Machine Compiler 

^ The following explains a virtual machine compiler for 

the above virtual machine 100. 

Fig. 52 is a block diagram showing the construction 
O of a virtual machine compiler 3400 for the above virtual 

# machine 100. The input to this virtual machine compiler is 

^ a source program 3404 written in a high-level language. The 

virtual machine compiler 3400 is a cross compiler for 
20 generating a virtual machine program 34Q5A composed of the 

specific virtual machine instructions shown in Fig. 2 of the 
above virtual machine 100 and sets of next instruction 
information 3405B that correspond to the virtual machine 
instructions. This virtual machine compiler 3400 includes 
25 an instruction sequence converting unit 34 02, a next 

instruction information generating unit 3401, and a relation 
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associating unit 3403. 

The instruction sequence converting unit 3402 
receives the source program 3404 via a circuit S from the 
network card 208 or the hard disk 207, and performs 
5 syntactic analysis to convert the source program 3404 into a 

virtual ra chine instruction sequence containing virtual 
machine instructions specific to the above virtual machine 
100, The instruction sequence converting unit 3402 
successively outputs the converted virtual machine sequence 
^10 to the next instruction information generating unit 3401 and 

U the relation associating unit 3403 via circuits CI and C3, 

p The next instruction information unit 3401 receives 

7i virtual machine instructions from the instruction sequence 

^ converting unit 3402, specifies a set of next instruction 

□HIS information for each virtual machine instruction, and 

3 outputs the specified sets of next instruction information 

0 to the relation associating unit 3403 in order via a circuit 

= c2 - Tne ' instruction sequence converting unit 3402 and the 

next instruction information generating unit 3401 adjust 
20 timing for outputting the virtual machine instructions and 

the next instruction information so that inputs of a virtual 
machine instruction and a corresponding set of next 
instruction information to the relation associating unit 
3403 are synchronized. 
25 Th€ relation associating unit 3403 associates a 

virtual machine instruction outputted from the instruction 
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sequence converting unit 34 02 with a corresponding set of 
next instruction information outputted from the next 
instruction information generating unit 34 01 as a pair, and 
outputs each virtual machine instruction and next 
5 instruction information to a storage area like the memory 

202 or the hard disk 207 as a final output program 3405 via 
circuits Dl and D2 . 

Figs. 53 and 54 shows data constructions of the 
source program 34 04 to input to the instruction sequence 
;g0 converting unit 3402 via the circuit S. Fig. 53 shows a 

tree construction corresponding to an instruction sequence 
33 "x;=(l+2) * (3+4)* of the source program 3404, and Fig. 54 

ip shows a data construction of each node constituting the 

tree. A node corresponds to each instruction making up the 
'^15 instruction sequence in the source program 3404, and 

!i contains an instruction type 5201, a pointer to left sub- 

ip tree 5202, and a pointer to right sub-tree 5203. 

Operation of Virtual Machine Compiler 
20 The following describes the processing of the virtual 

machine compiler 3400 that processes the source program 3404 

that has the data construction described above. 

Fig, 55 is a flowchart showing the procedure of the 

instruction sequence converting unit 3402. The instruction 
25 sequence converting unit 3402 reads an instruction sequence 

of the source program 3404 represented by the tree structure 
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(step 5402) and initializes a calculation stack used to 
track each branch of the tree construction (step 5403) . The 
instruction sequence converting unit 3402 then sets a 
pointer to a root node in the variable ptr (step 5404), 
generates a virtual machine instruction sequence/ i.e. 
virtual machine code corresponding to the instruction 
sequence represented in the tree construction (step 5405), 
and outputs it in units of bytes to the next instruction 
information generating unit 3401 and the relation 
associating unit 3403. 

Fig. 56 shows the detailed processing of step 5405 in 
Fig, 55. The instruction converting unit 3402 repeats the 
following processing, where a node placed on the left branch 
is processed (steps 5603-5606) before a node on the right 
branch (steps 5607-5610). Numerical values and addresses 
included in instruction types 5201 are outputted as they 
are, and other codes are outputted after being converted to 
a corresponding virtual machine code (steps 5611-5613). 
Note that the processing from steps 5601-5614 is invoked on 
a recursive call in steps 5605 and 5609 so that this 
processing is repeated for all the nodes contained in the 
tree construction. 

Figs. 57A-57D are flowcharts showing the detailed 
processing of step 5613 in Fig." 56. These flowcharts 
correspond to the source program shown in Fig. 53. The 
instruction sequence converting unit 3402 generates either 
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"Push', -Mult", "Pop", or "Add* opcode of a virtual machine 
instruction according to a stored value in the variable knd 
of each instruction type 5201 of a node (steps 5901-5909) . 

Fig. 58 is a flowchart showing the processing of the 
5 next instruction information generating unit 3401. The next 
instruction information generating unit 3401 receives 
virtual machine codes, which are successively outputted from 
the instruction sequence converting unit 3402, in units of 
bytes (step 5502), and judges whether each virtual' machine 
glO code except for the virtual machine code sent using the 
Uj first one byte is an operand, "Push" opcode of a virtual 

ff machine instruction, or an other opcode. The next 

P instruction information generating unit 3401 then specifies 

" a set of next instruction information Next corresponding to 

the virtual machine code and outputs the information N*xt to 
the relation associating unit 3403 (steps 5503-5509). Here, 
a set of next instruction information to be output last is 
fixed as "U" (step 5510) , 

Fig. 59 shows the processing of the relation 
associating unit 3403. The relation associating unit 3403 
initializes a variable prv that stores a set of next 
instruction information of a virtual machine instruction 
processed immediately before and an address Addr of a 
virtual machine code and associated next instruction 
information to be generated (step 6002) . The relation 
associating unit 3403 then repeats the following processing 
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(steps 6004-6010) until it judges that there are no virtual 
machine code to be read from the instruction sequence 
converting unit 3402 (step 6003) . 

The relation associating unit 3403 receives a 1-byte 
virtual machine code and the corresponding next instruction 
information Next from the instruction sequence converting 
unit 3402 and the next instruction information generating 
unit 3401 via the circuits CI and C2, respectively (steps 
6004 and 6005) . The relation associating unit 3403 then 
judges whether the next instruction information Next is M X" 
indicating that the prcscht next instruction information is 
the same as the immediately preceding information (step 
6006), and determines the next instruction information now 
of the virtual machine code (steps 6007 and 6008) . 
Following this, the relation associating unit 3403 outputs 
the determined next instruction information now and the 
virtual machine code as a pair to a location specified by 
the address Addr in a storage area, such as the memory 202, 
(step 6009) and prepares for the processing of the next 
virtual machine code (step 6010) . 

In this way, the virtual machine compiler 3400 of the 
present embodiment generates a virtual machine program used 
for the virtual machine 100 of the present embodiment from 
the source program 3404 written in high-level language . 
This generated virtual machine program contains a virtual 
machine instruction sequence and sets of next instruction 
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information, to be respectively stored in the instruction 
storing unit 102 and the next instruction information 
storing unit 101 of the virtual machine 100 shown in Fig. 
35. 

Here, note that input to the virtual machine compiler 
3400 of the present embodiment is not limited to a source 
program represented with a tree construction such as the 
source program 3403, but may be text written in a 
programming language such as C. in such a case, the 
instruction sequence converting unit 3402 may perform a 
preceding operation to convert the text to intermediate code 
using a tree construction or a three-operand method. 

Second Embodiment 

The following describes the virtual machine of the 
second embodiment, which execution rate is not affected by 
an interrupt processing. 

Construction of the Vi rtual Machine 

Fig. 60 is a block diagram showing the construction 
of the virtual machine 3500 of the present embodiment. This 
virtual machine 3500 includes ah instruction storing unit 
4401, a decoding unit 3502, an interrupt controlling unit 
3510, an executing unit 4410, and a stack 4420. 

As can be seen by comparing Fig. 60 with Fig. 1, this 
virtual machine 3500 includes basically the same elements as 
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the conventional virtual machine 4400. These elements in 
the two figures are assigned common numbers, and explanation 
of these elements will be omitted here. 

The differences between the conventional virtual 

5 machine 4400 and this virtual machine 3500 are as follows. 
First, in addition to the. elements included in the 
conventional machine 4400, this virtual machine 4400 
includes an interrupt controlling unit 3510 for controlling 
and executing processing that corresponds to an interrupt 

0 request to this virtual machine 3500. Secondly, the 

decodina unit 3502 outputs^ a control signal and decoded data 
to the branch instruction detecting unit 3505- Finally, the 
microprogram storing unit 4411 of the executing unit 3515 
newly stores an interrupt handling program 3516 which is a 

5 real machine program for interrupt handling. The following 

explanation focuses on these new aspects of the virtual 
machine 3500 that are not included in the conventional 
virtual machine 4400, 

The interrupt controlling unit 3510 detects if there 

!0 is an interrupt request every time the virtual machine 3500 

decodes and executes a branch instruction, and controls the 
processing to have the executing unit 4410 perform necessary 
interrupt handling. The interrupt controlling unit 3510 
includes a branch instruction detecting unit 3505, an 

5 interrupt instruction inserting unit 3506, and an interrupt 

state storing unit 3507. 
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The branch instruction detecting unit 3505 receives 
decoded data from the search unit 4405 via a signal line Dl, 
and judges if the received decoded data is a jump address of 
a microprogram corresponding to one of the following branch 
5 instructions of "Br", "Brz", "Brnz", "Call", and "Rat". If 

so, the branch instruction detecting unit 3505 turns on the 
signal line C2 and outputs the decoded data to the interrupt 
instruction inserting unit 3506, and if not, outputs the 
data with the signal line C2 left off. 
p|° The interrupt state storing unit 3507 is a storage 

^ area to store a state variable ID for specifying if an 

B interrupt request to the virtual machine 3500 exists and, if 

U SO/ a type of the interrupt. This interrupt state storing 

unit 3507 is physically allocated to a register of the 
p5 memory 202 or the network card 208, for instance. 

J; The interrupt instruction inserting unit 3506 is 

notified via the signal line C2 that the branch instruction 
detecting unit 3505 has detected a branch instruction. The 
interrupt instruction inserting unit 3506 then checks if 
there i:> an interrupt request by referring to the state 
variable ID stored at that point in the interrupt state 
storing unit 3507. If there is* an interrupt request, the 
interrupt instruction inserting unit 3506 outputs the state 
variable ZD and the decoded data for having an interrupt 
handling performed which is a jump address of the interrupt 
handling program 3516 of the microprogram storing unit 4410. 
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This output is performed preceding the output of another 
decoded data for the detected branch instruction. 

The interrupt handling program 3516 is a real machine 
program that reads an interrupt vector stored in the address 
5 on the memory 202 based on the state variable ID output ted 

from the interrupt instruction inserting unit 3506, and 
processes a subroutine in a location indicated by the 
interrupt vector. 

r! i 10 Operation of Virtual Machine 

jry! The following describes the processing of the virtual 

rip machine 3500 that has the above construction. 

Fig. 61 is a flowchart showing the detailed 
processing for outputting decoded data and searching the 

©15 table by the decoding unit 3502. This figure corresponds to 

„ — - 

O Fig. 7 in the description of the prior art, 

rh= difference between these flowcharts lies in the 
~' processing for outputting decoded data (steps 6108-6111). 

That is / the search unit 4405 reads a jump address 
20 corresponding to an opcode of a virtual machine instruction 

outputted from the instruction reading unit 4403 (step 
6106) , and outputs the read jump address as decoded data to 
the branch instruction detecting unit 3505 via a signal line 
Dl with a signal line Cl on (steps 6108-6110) . 
25 Fig. 62 is a flowchart* showing the processing of the 

branch instruction detecting unit 3505. The branch 
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instruction detecting unit 3505 reads decoded data via a 
signal 1 f ~2 Dl, stores it temporarily as ddata (steps 6202- 
6203) , and judges if the decoded data ddata is a jump 
address of a microprogram by referring to a state of the 
5 signal line CI (step 6204). If so, the branch instruction 

detecting unit 3505 also judges if the jump address is for a 
microprogram corresponding to one of the branch instructions 
"Br", "Brz", "Broz", "Call", and "Ret" that are stored in 
the branch instruction detecting unit 3505 in advance (step 
~10 6205) . If so, the branch instruction detecting unit 3505 

y= turns the signal line C2 oh (step 6206) and outputs the 

B decoded data ddata, which has been temporarily stored (steps 

pi 6206-6208) . if not, the decoded data ddata is outputted 

with the : signal line C2 turned off (steps 6207-6208) . 
E^ 15 Fig. 63 is a flowchart showing the processing of the 

^ interrupt instruction inserting unit 3506. The interrupt 

Qf instruction inserting unit 3506 reads decoded data via the 

signal line D2, stores it as ddafca2 temporarily (steps 6302- 
6303), and judges if the read decoded data ddata2 is a jump 
address of a microprogram corresponding to one of the above 
branch instructions referring to a state of the signal line 
C2 (step 6304) . If so, the interrupt instruction inserting 
unit 3506 reads a state variable XD from the interrupt state 
storing unit 3507 (step 6305), and judges if an interrupt 
has been generated by referring to the state variable ID 
(step 6303). if so, the interrupt instruction inserting 
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unit 3506 outputs the state variable ID and the jump address 
of the interrupt handling program 3516 to the executing unit 
4410 as decoded data for having a predetermined interrupt 
handling performed (step 6307) . Following this, the 

5 interrupt instruction inserting unit 3506 outputs other 

decoded data ddata2 for the branch instruction that has been 
temporarily stored to the executing unit 4410 {step 6308), 
As a result, the executing unit 3515 executes the interrupt 
handling program 3516 based on the state variable ID prior 

0 to the execution of the branch instruction. 

the other hand, ^. if the interrupt instruction 
inserting unit 3506 judges that the decoded data inputted 
via the signal line D2 is not a jump address of a 
microprogram for a branch instruction (step 6304), or that 

5 no interrupt has been generated (step 6306), then the 

temporarily stored decoded data ddata2 is simply outputted 
to the executing unit 4410 (step 6308) . 

In this way, the virtual machine 3500 of the present 
embodiment checks whether an interrupt has occurred to the 

!0 virtual machine 3500 each time it decodes and executes a 

branch virtual machine instruction, and if so, interrupt 
handling is additionally performed. 

Compared with a conventional virtual machine 4400, 
the virtual machine 3500 of the present embodiment needs to 

!5 execute one extra branch instruction for interrupt handling 

each time a virtual machine branch instruction is executed. 



as a result/ the number of accesses to memory increases by 
one for each virtual machine branch instruction. However, 
in a normal machine program, an average of six non-branch 
instructions exist between branch instructions, so that the 
increased number of accesses to the memory for one 
instruction becomes less than 0*2. Accordingly, by using 
the above interrupt handling function of the present 
embodiment for the virtual machine 100 of the first 
embodiment, the number of accesses to the memory can be 
reduced as a whole, and a virtual machine with an interrupt 
handling function and improved performance speed can be 
achieved without overriding the effect of the TOS variable * 

As has been described, the virtual machine 3500 of 
the present embodiment includes the interrupt controlling 
unit 3510 between the decoding unit 3502 and the executing 
unit 4 410, and interrupt detection and handling are carried 
out only when the branch instruction detecting unit 3505 
decodes and executes a virtual machine branch instruction. 
Accordingly, an interrupt detection is only performed at a 
more suitable frequency, and decreases in performance 
efficiency can be suppressed more than when interrupt 
detecting and handling are performed for every instruction 
execution. 

Note that, for the present embodiment, a virtual 
machine instruction is detected by monitoring decoded data 
transmitted from the decoding unit 3502, although this 
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detection may be achieved by monitoring each opcode of a 
virtual machine instruction inputted to the decoding unit 
3502, 

Instead of monitoring decoded data sent from the 
decoding unit 3502 to find a virtual machine branch 
instruction, the procedure of the interrupt instruction 
inserting unit 3506 may be provided to microprograms in the 
microprogram storing unit 4411 that correspond to branch 
instructions. This provides the same effect as described 
above to the virtual machine of the present embodiment. 

Third Embodiment 

The following describes a virtual machine of the 
third embodiment that can perform an interrupt handling 
while minimizing decreases in performance efficiency. 

Construction of the Virtual Machine 

lig. 64 is a block diagram showing the construction 
of the virtual machine 3600 of the present embodiment. This 
virtual machine 3600 includes an instruction storing unit 
4401, a decoding unit 3502, an interrupt controlling unit 
3610, an executing unit 4410, and a stack 4420. 

As can be seen by comparing Fig. 64 with Fig. 60, the 
present virtual machine 3600 has almost the same 
construction as the virtual machine 3500 of the second 
embodiment. The differences between the two lie in a block 



converting unit 3 605 replacing the branch instruction 
detecting unit 3505 of the second embodiment and in 
connections of the block converting unit 3605. The 
following explanation focuses on these differences between 
the present virtual machine 3 600 and the virtual machine 
3500 of the second embodiment- The block converting unit 
3605 converts the virtual machine codes decoded by the 
virtual machine 3600 into blocks, which is to say, detects 
if a predetermined number of virtual machine codes 10 byte, 
for instance, have been decoded and notifies the result of 
the detection to the interrupt instruction inserting unit 
3506. 

Operation of Virtual Machine 

The following describes the processing of the virtual 
machine 3 600 that have the above construction. 

Fig, 65 is a flowchart showing the processing of the 
block converting unit 3605. The block converting unit 3 605 
reads a set of decoded data inputted via a signal line Dl, 
temporarily stores it as ddata. (steps 6402-6403), and reads 
a value of PC 4404 at that point (step 6404), or other 
words, checks an address of a virtual machine code 
corresponding to the decoded data outputted from the 
decoding unit 3502. 

Following this, the bl-ock converting unit 3605 
divides the read PC value by a stored constant bsize to 
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generate a remainder m (step 6405), and judges if the 
remainder m is zero (step 6404) . If so, the block 
converting unit 3605 turns a signal line C2 on (step 6407) 
and outputs the ddata that has been temporarily stored 
(steps 6407-6409)- If judged not (step 6406), the block 
converting unit 3 605 outputs the stored ddata with the 
signal line C2 being left off (steps 6407-6409) . 

As in the second embodiment/ the interrupt 
instruction inserting unit 3506 only checks if an interrupt 
has occurred only when the signal line C2 is on. If so, the 
interrupt instruction inserting unit 3506 outputs another 
set of aecoded data for an interrupt handling to the 
executing unit 4410, the decoded data containing a jump 
address of an interrupt handling program stored in the 
microprogram storing unit 4411 and a state variable ID. 

In this way, an interrupt occurring to this virtual 
machine 3 600 is checked every time the virtual machine 3 600 
has decoded a predetermined number bsize of virtual machine 
codes, and if an interrupt has occurred, interrupt handling 
is additionally performed. . Accordingly, an interrupt 
detection is performed only once for a block of virtual 
machine codes whose number is specified by a constant bsize. 

Accordingly, by setting a value higher than a certain 
value in the constant bsize and using the above interrupt 
handling function of the present embodiment for the virtual 
machine 100 of the first embodiment, the number of accesses 
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to the memory can be reduced as a whole, and a virtual 
machine with an interrupt handling function and improved 
performance speed can be achieved without overriding the 
effect of the TOS variable whereby a reduced number of 
memory accesses can be made. 

Although the block converting unit 3605 of the 
present virtual machine 3600 refers to the PC 4404, this 
reference does not increase the number of memory accesses 
since t^r PC 4404 is associated to register #2 <r2) of the 
real machine 201. 

Also, with the present virtual machine 3600, the 
number of memory accesses can be flexibly controlled by 
changing a value of the constant baize. 

The decoding unit of the present embodiment compares 
the constant bsize with a value of PC 4404 corresponding to 
decoded data sent from the decoding unit 3502, although the 
constant bsize may be compared with a value of an internal 
counter that is provided in the decoding unit 3502 .and 
counts a number of "on" signals on the signal line CI* In 
this case, an interrupt detection processing is performed 
for a g-t-'^up of virtual machine codes corresponding to not a 
predetermined number of bytes but a predetermined number of 
instructions . 

With the present embodiment, the interrupt 
controlling unit 3610 independently performs blocking, 
although the blocking may be performed by the executing unit 
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4410 referring to the PC 4404, if the procedure of the 
interrupt controlling unit 3610 is additionally stored in 
the microprogram storing unit 4411. 

5 Fourth Embodiment 

following describes the virtual machine of the 
fourth embodiment. This virtual machine is highly 
independent of the architecture of a real machine. 

Oo Construction of the Virtual Machine 

\W Fig. 66 is a block-diagram showing the construction 

p of the virtual machine 3700 in this fourth embodiment. This 

virtual machine 3700 includes an instruction storing unit 
™- 3701, a decoding unit 4402, an executing unit 3710, and a 

35 stack 4420* 

5 As be seen by comparing Fig, 66 with Fig, 1, the 

| present virtual machine 3700 has almost the same 

^ construction as the conventional virtual machine 4400, The 

differences between the two lie in the content of the 
executing unit 3710 t in the executing unit 3710 being 
provided with the area judging unit 3704 and the address 
converting unit 3705, and in th£ provision of the real 
machine function storing unit 3706. The following 
explanation focuses on these differences between the present 
virtual machine 3700 and the conventional virtual machine 
4400. 
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The real machine function storing unit 3706 stores a 
set of the functions (called "real machine functions") that 
are inr!V^7.d in real machine instructions beforehand* In 
other words, the real machine function storing unit 3706 
stores a set of the functions that execute the routine 
processing required by virtual machine programs as an 
execution library. This real machine function storing unit 
3706 is physically assigned to an area in the memory 202. 
As one specific example, a total of machine 
functions numbered from the 0 th to the (KM^-F*^) th are 
stored. ^ 

The instruction storing unit 3701 stores not just the 
virtual machine program to be executed, but also a real 
machine function table beforehand. This real machine 
function' i:able is a set of pointers (start addresses) for 
the different real machine pointers stored in the real 
machine function storing unit 3706. 

Fig. 67 shows a memory map of the instruction storing 
unit 3701 , which is to say how different memory areas in the 
instruction storing unit 3701 are used when seen from the 
virtual machine 3700. The area between the addresses 
and is assigned to the virtual machine program 6501, 

which is to say, to an area where a set of the functions 
given in virtual machine instructions are arranged. in the 
following area between the addresses K*^, and is 
assigned to an area that stores the real machine function 
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table 6502, Note that this area of the real machine 
function table 6502 is located directly after the virtual 
machine program 6501. This means that the address is 
equal to the address V*C— +1- 
5 Fig. 68 shows the construction of the real machine 

function table 6502 shown in Fig. 67. In the area of the 
instruction storing unit 3701 with the addresses I&W-ra^, 
pointer to the real machine functions numbered 0- (RM-^-RM^J 
are given. However, these pointers are stored in reverse 

O10 order to the assignment of addresses. As one example, the 

jy 0 th real machine function i^rthe function executed when the 

virtual iiLdchine function located at the address is 
called. Similarly, the (EO^-ra^) tn real machine function is 

W the function executed when the virtual machine function 

;Ql5 located at the address is called. 

^3 The area judging unit 3704 oversees the decoded data 

g output ted by the decoding unit 4402 and, when a function 

call instruction "Call" is to be executed by the executing 
unit 3710, judges before the function call is performed 
20 whether the called function is in the virtual machine 

program 6501 or in the area where the real machine function 
table 6502 is located. 

The address converting unit 3705 operates as follows. 
When the area judging unit 3704 judges that the virtual 
25 instruction to be executed is a function call instruction 
"Call" that calls a function in the real machine function 

94 



table 6502, the address converting unit 3705 directly has 
the real machine 201 execute a real machine function in the 
real machine function storing unit 3706 that is indicated by 
the function pointer in the real machine function table 6502 
that corresponds to the call address* 

Operation of virtual Machine 

The following describes the operation of the virtual 
machine C7G0. 

Fig* 69 is a flowchart that shows the operation of 
the executing unit 3710 in-- -the virtual machine 3700. This 
drawing focuses in particular on the operation of the area 
judging unit 3704 and the address converting unit 3705 when 
decoded data for a function call operation "Call" has been 
sent from the decoding unit 4402. 

The area judging unit 3704 oversees the decoded data 
sent from the search unit 4405 and the state of the signal 
line R* On discovering that the operand of the function 
call instruction "Call" has been sent from the decoding unit 
4402 , the area judging unit 3704 judges, before the function 
call instruction is executed, whether the call address craddr 
indicated by the operand is within a range given as the 
addresses R^^-ra^, and by doing so judges whether the call 
address is located in the area,- "that stores the real machine 
function table 6502 (steps 680'2-68Q4) . 

When the call address Jaddx is judged as being in 
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this area f the address converting unit 3705 calculates an 
index icbc for the real machine function table 6502 
corresponding to the call address Jaddr, based on the 
reverse order described above (step 6805) . The address 
5 converting unit 3705 then reads the pointer ptr stored in 

the entry of the real machine function table 6502 indicated 
by the index idx (step 6806) . The executing unit 3710 then 
directly executes the real machine function in the real 
machine function storing unit 3706 shown by the pointer ptr 

OlO in place of the original virtual machine instruction "Call" 

!!J (step 6807) . 

£B On the other hand, when the area judging unit 3704 

g judges that the call address Jaddr of the function call 

instruction "call" is not in the same area as the real 
: ^15 machine function table 6502, the executing unit 3710 

O proceeds with the execution of a standard function call 

?# (steps 6808-6810). This means that the executing unit 3710 

stores the return address (steps 6808, 6809), and then 
executes the virtual machine function located at the call 
20 address Jaddr (step 6810) . 

In this way, when the call address Jaddr of the 
virtual machine instruction "Call" belongs to the area of 
the virtual machine program 6501, the virtual machine 
function is called as it is. However, when the call address 
25 Jaddr belongs to the real machine function table 6502, the 

corresponding real machine function is executed. 
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As can be seen from the memory map shown in Fig* 67, 
switches between executing a virtual machine function or a 
real machine function in response to a function call 
instruction "Call" can be easily achieved by shifting the 
boundary line between the areas 6501 and 6502. As one 
example, when the address Vb^^ that marks the boundary is 
lowered, the address RM»^ is also lowered, so that for a 
function call instruction "Call" with the same call address, 
a switch can be made from having a virtual machine function 
executed to having a real machine function executed. In the 
same way, when the address> that marks the boundary is 

raised, a switch can be made- from having a real machine 
function executed to having a virtual machine function 
executed. 

As described above, the virtual machine 3700 of the 
present embodiment achieves control that calls virtual 
machine functions as they are or has real machine functions 
performed in place of virtual machine functions based on the 
setting of just one parameter Vb^. This means that the 
virtual machine 3700 has a favorable architecture for a 
virtual machine that is enacted on a variety of real 
machines and computer environments. This is because before 
execution a virtual machine program can be partially 
converted into real machine functions in keeping with a 
variety of real machines and computers that have different 
architectures. Here, the division into parts executed as 
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virtual machine functions and into parts executed as real 
machine functions can be separately determined for each 
different architecture . 

In this way, no deterioration in processing speed is 
observed, and a virtual machine that is highly independent 
of the architecture of real machines can be realized. 

Note that while the present embodiment states that 
only the virtual machine program 6501 is located in the area 
between the addresses V*^^ and in the instruction 

storing unit 3701, this is not a limitation for the present 
invention. As one example^'- Fig * 70 shows that memory 
attributes for each address ("V" or "R" ) , and, corresponding 
to these attributes, data (a virtual machine program) or an 
index for the real machine function tabic may be stored- By 
doing so, it is possible to switch between executing a 
virtual machine function as it is and executing a real 
machine function in response to virtual machine functions 
that call the same address, without shifting the boundary 
line v^. 

Fifth Embodiment 

The following describes* the virtual machine system of 
the fifth embodiment of the present invention. This 
embodiment reduces the processing load for converting 
virtual machine programs into cache blocks and the time 
required by a JIT compiler to compile the virtual machine 
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program. 



Construction of Virtual Machine 

Fig. 71 is a block diagram showing the construction 
of the virtual machine 3800 in this fifth embodiment. This 
virtual machine 3800 includes an instruction storing unit 
3801, a decoding unit 3802, an executing unit 3810, and a 
stack 4420. 

As can be seen by comparing Fig, 71 with Fig* 1, the 
present virtual machine 3800 has almost the same 
construction as the conventional virtual machine 4400* The 
differences between the two lie in the content of the 
executing unit 3810, in the construction of the PC 3804, and 
in the branch destination converting unit 3811 being added 
to the executing unit 3810* The following explanation 
focuses on these differences : between the present virtual 
machine 3800 and the conventional virtual machine 4400. 

The instruction storing unit 3801 stores the virtual 
machine program to be executed split into units called 
instruction blocks. The instruction storing unit 3801 is 
composed of a plurality of instruction block storing areas 
3852a-3852d that each store an instruction block. 

In this embodiment, an instruction block refers to a 
basic block in the virtual machine program to which a unique 
identifier has been assigned and to which a branch 
instruction for continuing the logical flow of the virtual. 
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machine program has been appended. These instruction blocks 
are created by a special compiler for the virtual machine 
3800 that is described later in this embodiment. Note that 
a basic block is an instruction sequence that starts with an 
5 instruction that is the sole entry point into the basic 

block ana ends with an instruction that is the sole exit 
point from the basic block. In this embodiment, the 
identifier of an instruction block is composed of address 
information that specifies the start of the instruction 
rJO block in an instruction block storing area. 

jij The instruction block storing areas 3852a~3852d each 

g include an identifier storing area 3853a, a non-branch 

^jf instruction storing area 3854a~3854d that stores 

?M instructions/ out of the virtual machine instructions that 

Ol5 belong to the corresponding instruction block, that are not 

q branch instructions (such instructions hereafter being 

% called "non-branch instructions"), and a branch instruction 

CP storing area 3855a~3855d that stores only the branch 

instructions in the corresponding instruction block . 
20 Fig, 72 shows an example of the stored state of a 

virtual machine program that has been stored in the 
instruction storing unit 3801. ^This shows the case when the 
sample virtual machine program shown in Fig. 27 is stored. 

As shown in Fig. 72, the virtual machine program is 
25 divided into four instruction -blocks 3852a-3852d. These 

instruction blocks 3852a-3852d are composed of the 
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instruction block identifiers 38533-3853(1/ the non-branch 
parts 3854a-3854d that include all parts of the instruction 
block aside from the branch instructions, and the branch 
parts 3855a-3855d that include the branch instructions 
5 located at the end of basic blocks and the branch 

instructions used for linking instruction blocks to the 
following basic block. 

Note that the virtual machine programs shown in Fig. 
72 and in Fig. 27 have the control flow shown in Fig* 73 and 
00 so have effectively the same processing content. This 

ilif should be clear from the meanings of the virtual machine 

;5fJ instructions shown in Fig. 2. 

j^; PC 3804 includes the identifier segment register 

W 3804a and the offset counter 3804b. The identifier segment 

4335 register 3804a stores a segment address that is equivalent 

O to the identifier of the instruction block that includes the 

% virtual machine code in the instruction storing unit 3801 

© which should be read next. This segment address is 

hereafter called the "identifier segment". The offset 
20 counter 3804b stores an offset for the instruction block 

including that virtual machine code. 

Note that the present virtual machine 3800 performs 
16-bit addressing, as shown in Fig- 74, with the upper 8 
bits being the identifier segment and the lower 8 bits being 
25 the offset. This is to say, an 8-bit identifier segment is 

stored in the identifier segment register 3804a and an 8-bit 



offset is stored in the offset counter 3804b. The 16-bit 
address given by linking these together specifies one 
virtual machine code in the instruction storing unit 3801. 

The branch destination converting unit 3811 operates 
as follows. When a branch instruction is executed by the 
executing unit 3810, the branch destination converting unit 
3811 updates the instruction block identifier that is the 
branch destination using the combination of the identifier 
segment and offset, and stores the converted result in the 
PC 3804. 

Operation of Virtual Machine 

The following describes the operation of the virtual 
machine 3800. 

The decoding unit 3802 and the executing unit 3810 
operate in almost the same way as the corresponding 
components in the conventional virtual machine 4400. The 
differences between the two are that during normal 
operation, only the offset counter 3804b of the PC 3804 is 
updated by the executing unit 3810, and that when a branch 
is executed, the identifier segment register 3804a and the 
offset counter 3804b of the PC ."3804 are updated by the 
branch destination converting unit 3811- 

Fig. 75 is a flowchart-showing the operation of the 
branch destination converting -unit 3811 in the executing 
unit 3810. This branch destination converting unit 3811 
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first obtains the operand of a branch instruction, which is 
to say the 8-bit instruction block identifier Jaddr, from 
the decoding unit 3802 (step 8102). The branch destination 
converting unit 3811 sets this as the identifier segment of 
the branch destination, sets the offset as zero, and 
generates a 16-bit physical address which it uses to update 
the identifier segment register 3804a and the offset counter 
3804b of the PC 3804 (step 8103). 

Fig, 7 6 shows this address conversion by the branch 
destination converting unit 3811, where a logical address 
and identifier in the virtual machine program shown in Fig. 
72 are replaced with a physical address. As one example, 
the operand "x03" of the branch instruction "Brz" in the 
instruction block with the identifier number 1 in Fig, 72 is 
converted by the branch destination converting unit 3811 
into the physical address "X0300" at the start of the 
instruction block with the identifier number 3. 

In this way, whenever the executing unit 3810 
executes a branch instruction, the executing unit 3810 
performs control so that processing branches to the start of 
the instruction block indicated by the operand of the branch 
instruction* By doing so, the 'virtual machine 3800 decodes 
and executes virtual machine programs that have been stored 
divided into instruction blocks using effectively the same 
procedure that is used for programs that are not divided 
into instruction blocks - 
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Construction o£ the virtual Machine Compiler 

The following describes a virtual machine compiler 
for the virtual machine 3800. 

Fig. 77 is a block diagram showing the construction 
5 of the virtual machine compiler 7 660 in this fifth 

embodiment. This virtual machine compiler 7660 receives an 
input of a source program 7650 that is written in a high- 
level language such as C, and converts the source program 
7650 into a suitable form for storage into the instruction 
00 storing unit 3801 of the virtual machine 3800, this suitable 

jjj form being the instruction^ block set 7651- The virtual 

2 machine compiler 7 660 includes an intermediate instruction 

:iy sequence converting unit 7 661, a generating unit 7 662, and a 

;;jiJ block converting unit 7 663, 

The intermediate instruction sequence converting unit 
;E 7 661 performs syntactic analysis on an inputted source 

® program and develops temporary intermediate code that is 

# used for optimization. The generating unit 7 662 converts 

the intermediate code developed by the intermediate 
20 instruction sequence converting unit 7 661 into the code of a 

virtual machine program 7 664, such as that shown in Fig. 27. 
This intermediate instruction sequence converting unit 7 661 
and generating unit 7 662 have the same functions as the 
equivalent components in a standard conventional virtual (or 
25 real) machine compiler. 

The block converting unit 7 663 converts the virtual 
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machine program generated by the generating unit 7 662 into a 
set of instruction blocks that can be stored in the 
instruction storing unit 3801. when doing so, the main 
processes are the division into basic blocks and the setting 
5 of addresses in accordance with the division- This setting 

of addresses is a process whereby the branch destinations 
used by branch instructions in the virtual machine program 
7 664 are replaced with instruction block identifiers ID. 

Next, the block converting unit 7663 generates and 
310 uses a branch address conversion table 7 663a as a temporary 
7? variable table for setting; the addresses. The construction 

% of the branch address conversion table 7 663a is shown in 

Fig. 78. 

J Each row (entry) in the branch address conversion 

□il5 table 7 663a is generated corresponding to either a different 

^ branch instruction in the virtual machine program 7 664 that 

if is inputted into the block converting unit 7 663 or one of 

3^ the generated instruction blocks. In each entry: 

• "code position" shows the first address in the 

20 instruction block or an address of the branch 

instruction in the virtual machine program 7 664. 

"registration flag" is -a flag showing whether the 

address setting has been completed for the branch 

instruction- 

25 • "reference position identifier" and "reference 

position offset" show the instruction block 
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identifier and offset where 
located or where the branch 
to the instruction block is 




the branch instruction is 
instruction that branches 
located. 



5 Operation of the Virtual Machine Compiler 

Fig. 7 9 is a flowchart showing the characteristic 
operation of the virtual machine compiler 7660, which is to 
say the operation of the block converting unit 7 663. First , 
the block converting unit 7 663 resets the instruction block 

10 identifier ID of the instruction block generated as part of 

the instruction block set ?7'651, the pointer offset that 
shows the relative instruction storage position in the 
instruction block, the counter PC that shows the position of 
a one-byte virtual machine code vc that has been read in 

15 order from the virtual machine program 7 664, and the counter 

Rcourrb that shows the number of branch destinations whose 
branch addresses need to be updated (steps 7602-7603) . 

As its fundamental operation, the block converting 
unit 7 663 reads the virtual machine codes VC one byte at a 

20 time from the virtual machine program 7 664 while updating 

the counter PC. The block converting unit 7 663 outputs a 
read virtual machine code VC together with the identifier ID 
of the instruction block to which the virtual machine code 
VC should belong and the pointer offset that is a relative 

25 position in this instruction blocks as one element in the 

instruction block set 7651 (steps 7604-7611) - 



106 




When doing so, the block converting unit 7663 judges 
whether the virtual machine code VC is located at the start 
of a basic block (step 7607), and judges whether the virtual 
machine code VC is a branch instruction (step 7 608) ♦ If 
either of these judgments is affirmative f the block 
converting unit 7663 executes a special procedure (steps 
7701-7704 or step 7609). 

Fig, 80 shows the details of the judgment in step 
7607 ot cig. 79, which is to say, the judgment as to whether 
the virtual machine code VC should be made the start of a 
basic block. If the virtual", machine code VC corresponds to 
either a branch destination instruction or an instruction 
located immediately after a branch instruction, the block 
converting unit 7663 judges that the virtual machine code vc 
corresponds to the start of a basic block (step 7302-7306) - 

As shown in Fig. 7 9, ^ when the virtual machine code VC 
is judged as being the start of a basic block, the block 
converting unit 7 663 updates the identifier ID to generate a 
new instruction block (step 7701) and generates an 
unconditional branch instruction to link the end of the 
immediately preceding instruction block (identifier ID) with 
the next instruction block (identifier NID) (step 7702). 
The block converting unit 7 663 then prepares for the 
generation of virtual machine ; codes in the new instruction 
block (step 7703), and sets addresses in accordance with the 
setting of the identifier NID (step 7704) . 
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On determining in step 7 608 that the virtual machine 
code VC is a branch instruction, the block converting unit 
7663 performs address setting to convert the branch 
destination of the branch instruction to a suitable address 
(step 7609) . This address setting is performed because the 
processing of branch instructions and addition of new branch 
instructions by the block converting unit 7 663 results in a 
rearrangement of the virtual machine instructions in the 
original virtual machine program 7 664. 

Fig. 81 shows the details of step 7704 in Fig. 79, 
which is to say the settir^rof addresses in accordance with 
the allocation of the identifier NID of a new instruction 
block. Here, on discovering that the branch address of a 
branch iusLruction may now be set in accordance with the 
allocation of the identifier NID to a new instruction block, 
the block converting unit 7&63 sets the branch address for 
the branch instruction (steps 7905-7910). When this is not 
the case, the block converting unit 7663 additionally 
registers information into the branch address conversion 
table 7663a so that the address of a branch instruction that 
branches to this instruction block can be set in a later 
process (steps 7913,7914). 

Fig. 82 shows the details of step 7609 in Fig. 79, 
which is to say the setting of an address of a branch 
destination that is indicated' by a branch instruction in the 
virtual machine program 7664. Here, when the branch 
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instruction is a branch to a preceding position, which is to 
say, a branch to an instruction block that has already been 
regist*^^ in the branch address conversion table 7663a, the 
block converting unit 7 663 sets the address by replacing the 
branch destination of the branch instruction with the 

instruction block identifier rID (steps 7802-7809, 7812). 
When this is not the case, the block converting unit 7 663 

registers a new entry in the branch address conversion table 

7 663a to show that the address has not been set (steps 7810, 

7811) . 

As described above^Tthe virtual machine compiler 7 660 
converts a source program written in a high-level language 
into a standard virtual machine program 7664 like that shown 
in Fig* 27, divides the virtual machine program 7664 into 
basic blocks, and allocates identifiers to the basic blocks. 

The virtual machine compiler 7 660 then adds branch 
instructions for linking the basic blocks and sets addresses 
in accordance with the allocation of identifiers so as to 
convert the virtual machine program 7664 into an instruction 
block set 7 651 that can be executed by the virtual machine 
3800 of the present embodiment . 

Considerations 

With the virtual machihe 3800 and the virtual machine 
compiler 7 660 of the present Embodiment, the virtual machine 
program to be executed will not be stored in the instruction 
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storing unit 3801 and executed in the conventional state 
shown in Fig. 27. Instead, the virtual machine program 
executed having been stored in the instruction storing unit 
3801 divided into basic blocks. This has the technical 
5 consequences described below. 

First, let us examine the time taken by compiling by 
a JIT compiler* 

As described above, a conventional JIT compiler needs 
to analyze whether any branch destination in the virtual 
pO machine program violates certain restrictions. If such a 
^ branch destination is present, a JIT compiler needs to 

;:l£ perform ^ ^o-ocess, such as moving the branch destination. 

W However, with the present virtual machine system, it is 

4i| guaranteed that each branch destination will be the start of 

"Ol5 an instruction block. As a result, such conventional 

%| processing of branch destinations is largely unnecessary if 

Lhe present invention is used* 
# A conventional JIT compiler also needs to perform 

processes due to the presence of instructions like delayed 
20 branches. An example of such a process lor a delayed branch 

is the specifying of instructions that are unaffected by the 
delayed branch and so can be located immediately after the 
branch instruction. However, with the present virtual 
machine ' system, the virtual machine program is stored in the 
25 instruction storing unit 3801 'so that each instruction block 

is divided into a non-branch instruction storing area and a 
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branch instruction storing area. It is also guaranteed that 
in one branch instruction storing area, one branch 
instruction can only be followed by one more branch 
instruction at most- As a result, most of such processes 
5 that are required due to the presence of delayed branches 

and the like do not need to be performed with the present 
invention. 

The following describes the impact of the present 
invention with regard to the compatibility of programs to 

10 the cache construction of a virtual machine. 

When making program's" compatible with a conventional 
cache construction, it is necessary when dividing the 
virtual machine program into cache blocks to check that none 
of the virtual machine instructions that change the program 

15 counter change it to a value that crosses a boundary with 

another cache block. However, with the present virtual 
machine system, if the virtual machine program stored in the 
instruction storing unit 3801 is cached in instruction block 
units, all virtual machine instructions that change the 

20 program counter to a value that crosses a cache boundary 
will belong to a branch instruction storing area 
3855a~3855d. 

Fig. 83 shows the relationship between the PC 3804, 
the instruction block storing , areas 3852a-3852d and the 
25 cache table 8084 when caching'is performed by the virtual 

machine 3800 of the present embodiment in instruction block 
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units- This corresponds to the conventional art shown in 
Fig- 32. Conventionally, an ten-address instruction 
sequence 6903 is placed in the instruction cache 6902 as a 
cache block. With the present virtual machine 3800, 
5 however, instructions are arranged into the instruction 

cache i_j -nits of instruction blocks 3852a-3852d, with these 
being managed using the identifiers in the cache table 8404, 
as shown in Fig* 83. 

Fig. 84 is a flowchart showing the instruction 
QlO processing of branch instructions by the executing unit 3810 

fj when instructions are cached: in instruction block units in 

I5J the virtual machine 3800 of the present embodiment. This 

^ corresponds to the Fig. 75 where units are not reconciled to 

^ the cache construction. By comparing these drawings, it can 

Ol5 be seen that the virtual machine 3800 can be made into a 
,Q suitable virtual machine for, the cache construction by 

referring to the identifiers in the cache table 8404 and 

; '- : -y* 

judginy 'm instruction block units whether a cache hit is 
made (step 8504), and then performing a write into the 
20 instruction cache 8402 when there is a cache miss (step 

8505) . 

In this way, by caching a virtual machine program in 
instruction block units, processes that were conventionally 
necessary, such as judgments regarding the cache boundaries, 
25 are no longer required. Even' when an instruction needs to 

be loaded into the cache as a result of a cache miss, the 
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original virtual machine program will already has been 
divided into instruction blocks, so that there is a reduced 
load f^_- loading process. 

As described above, the virtual machine system of the 
present embodiment converts a source program into a standard 
virtual machine program and then divides the virtual machine 
program into instruction blocks using basic blocks as units. 
These instruction blocks are stored in the instruction 
storing unit 3801 and the branch destinations of each branch 
instruction are converted into the identifiers of 
instruction blocks* As a ^result, the address analysis 
processing for branch destination instructions by a JIT 
compiler is simplified, and the timing taken by compiling is 
reduced. By caching instructions in instruction block 
units, cue judgment processing regarding the cache 
boundaries is simplified, and decreases in execution 
efficiency that occur when a cache is provided for the 
virtual machine can be made smaller than in conventional 
techniques. 

Note that while the virtual machine compiler 7 660 of 
the present embodiment is provided with an intermediate 
instruction sequence converting unit 7 661 and a generating 
unit 7 662, it should be obvious that a standard compiler for 
generating a virtual machine program from a source program 
may be used instead. 
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Sixth Embodiment 

The following describes the virtual machine of the 
sixth embodiment. This virtual machine has a faster 
decoding process than the virtual machine of the fifth 
embodiment . 

Construction of the Virtual Machine 

Fig. 85 is a block diagram showing the construction 
of the virtual machine 3900 in this sixth embodiment. This 
virtual machine 3900 includes an instruction storing unit 
3901 , a decoding unit 3 9025 "-.'"an executing unit 3910/ and a 
stack 

As can be seen by comparing Fig, 85 with Fig. 71, the 
present virtual machine 3900 has almost the same 
construction as the virtual machine 3800 of the fifth 
embodiment. The differences^ between the two lie in the 
stored content of the instruction storing unit 3901, in the 
provision of the current flag storing unit 3907 in the 
decoding unit 3902, in the functions of the instruction 
reading unit 3903, and in the addition of the current flag 
read control unit 3912 to the executing unit 3910. The 
following explanation focuses on these differences between 
the present virtual machine 3900 and the virtual machine 
3800 ox che fifth embodiment- 

The instruction storing unit 3901 stores the virtual 
machine program to be executed split into a plurality of 
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instruction blocks 3952a~3952d, in the same way as the 
instruction storing unit 3801 in the fifth embodiment. 
However, the instruction block storing areas 3952a~3952d of 
the sixth embodiment differ in further including decoded 
instruction sequence storing areas 3956a~3956d for storing 
decoded data sequences that correspond to all of the virtual 
machine codes stored in the non-branch instruction storing 
areas and branch instruction storing areas (collectively 
called + "virtual machine code area") of the corresponding 
instruction block. 

Figs. 8 6A to 8 6C shows examples of the stored state 
of virtual machine programs in the instruction storing unit 
3901. These correspond to the case when the sample virtual 
machine program shown in Fig, 27 is stored. 

As shown in Figs. 86A to 8 6C, the decoded instruction 
sequence storing areas 3956ar3956d provided in the 
instruction block storing areas 3952a~3952d further include 
real machine code areas 8607a-8 607c for storing the decoded 
instruction sequences and the flag areas 8605a-8605c for 
storing flags that respectively show whether a decoded 
instruction sequence is stored in the real machine code 
areas 8607a-8607c. As one example, the instruction block 
storing area 3952b shown in Fig. 86B does not have a decoded 
instruction sequence in the re£l machine code area 8607b f so 
that flag ("empty") showing ari indication of this is stored 
in the flag area 8605b. On the other hand, the instruction 
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block storing area 3952c shown in Fig, 86(c) has a decoded 
instruction sequence in the real machine code area 8607c, so 
that a flag ("present") showing an indication of this is 
stored in the flag area 8605c, 
5 Note that the decoded instruction sequence that 

should u r stored in each real machine code area can obtained 
in advance, such as by using the virtual machine 3800 of the 
fifth embodiment ♦ This is because the decoded instruction 
sequence is the same as the decoded data sequence outputted 
.00 by the decoding unit 3802 to the executing unit 3810 when 

the virtual machine 3800 o¥\ the fifth embodiment executes 
the virtual machine program in each instruction block. 
^ In each instruction block, the separate virtual 

machine instructions located in the virtual machine code 
€315 areas 3954a~3954d, 3955a-3955d and the corresponding decoded 

;g data located in the real machine code area 8607a-8 607d are 

^ arranged at positions with addresses that are separated by a 

© predetoAuLacd offset. 

The current flag storing unit 3907 is a temporary 
20 storage area that holds a flag that is stored in the flag 

area of the instruction block in the instruction storing 
unit 3901 that is currently being executed by the virtual 
machine 3900. 

The instruction reading unit 3903 reads a virtual 
25 machine instruction or decoded data from the instruction 

storing unit 3901, based on the value of the flag held by 
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the current flag storing unit 3907, and outputs the read 
item to the search unit 4405 or executing unit 3910. This 
means t^?+- when decoded data is read, the search unit 4405 
is bypassed, so that the decoded data is sent directly to 
the executing unit 3910. 

The current flag read control unit 3912 checks 
whether the decoded data sent from the decoding unit 3902 is 
a branch instruction. If so, the current flag read control 
unit 3912 controls the decoding unit 3902 immediately after 
the branch instruction is executed, so that flag stored in 
the flag area of the branc& ^destination instruction block is 
read and stored in the current flag storing unit 3907. 

Operation of Virtual Machine 

following describes the operation of the virtual 
machine 3900. 

Fig. 87 is a flowchart showing the operation of the 
decoding unit 3902. 

The instruction reading unit 3903 of the decoding 
unit 3902 is instructed by the executing unit 3910 via the 
signal line R to read a next virtual machine instruction 
(steps 8702, 8703) . The instruction reading unit 3903 then 
reads the flag held by the current flag storing unit 3907 
and judges its content (step 8704) . 

On judging that a decoded instruction sequence is not 
included, the instruction reading unit 3903 operates in the 
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same way as in the fifth embodiment. The instruction 
reading unit 3903 reads the virtual machine code stored in 
the branch instruction storing area or non-branch 
instruction storing area in accordance with the address in 
the virtual machine code area that is stored in the PC 3804, 
and passes the read virtual machine code over to the search 
unit 4405 (steps 8705, 8706) * Next, the search unit 4405 
specifies the jump address by referring to the decode table 
44 06, and outputs the jump address to the executing unit 
3910 as decoded data (step 8707) , before sending 
notification of this on th©': signal line R (step 8711) . 

pT> the other hand, on judging from the current flag 
that a decoded instruction sequence is included, the 
instruction reading unit 3903 calculates an address in the 
real machine code areas 8607a~8607d by adding the 
predetermined offset to the address in the virtual machine 
code area stored in the PC 3804 (step 8708) . The 
instruction reading unit 3903 then reads the decoded data in 
accordance with the calculated address (step 8709) and 
outputs the read decoded data directly to the executing unit 
3910 (step 8710) - 

Fig. 88 is a flowchart Showing the operation of the 
executing unit 3910. 

^ig. 88 has fundamentally the same flow as the 
conventional art shown in Fig/ 9. The PC 3804 and SP 4412 
are initialized (step 8802), and then the microprogram in 
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the microprogram storing unit 4411 is executed based on the 
decoded data sent from the decoding unit 3902 (steps 
8804-8808) . 

The difference with Fig. 9 lies in the addition of 
the processing that involves the current flag storing unit 
3907 (step 8803) - On starting its operation, the executing 
unit 3910 stores a flag showing that no decoded data 
sequence is present into the current flag storing unit 3907 
to initialize the value of the current flag (step 8803) . 

Fig* 89 is a flowchart showing the control performed 
for the decoding unit 3902*when the executing unit 3910 
executes a branch instruction. As can be understood by 
comparing Fig. 89 with Fig. 75, when the executing unit 3910 
executes a branch instruction, the branch destination 
converting unit 3811 converts the operand of the branch 
instruction into an identifier segment of the branch 
destination instruction block and initializes the offset. 
The branch destination converting unit 3811 stores. this 
identifier segment and updated offset respectively into the 
identifier segment register 3804a and the offset counter 
3804b of the PC 3804 (steps 8902, 8903), though this 
processing is same as in the f i'f th embodiment . 

The difference with the fifth embodiment lies again 
in the addition of the processing that involves the current 
flag storing unit 3907 (step S904) . After the PC 3804 has 
been updated by the branch destination converting unit 3811 
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(steps 8902, 8903), the current flag read control unit 3912 
controls- the instruction reading unit 3903 30 as to read the 
value of the flag area in the instruction block shown by the 
identifier segment stored in the identifier segment register 
3804a and store the read value into the current flag storing 
unit 3907 (step 8904) . As a result, when a branch is 
performed to a new instruction block, the content of the 
current flag storing unit 3907 is updated, with a flag 
showing whether a decoded instruction sequence is stored in 
the real machine code area of the instruction block to be 
executed next being set in^the current flag storing unit 
3907. 

As described above, the virtual machine 3900 of the 
present embodiment divides a virtual machine program to be 
executed into instruction blocks that are generated from 
basic blocks* These instruction blocks are stored in the 
instruction storing unit 3901. However, instruction blocks 
do not just include virtual machine instructions, and so may 
also include decoded data that corresponds to the virtual 
machine instructions. The decoding unit 3902 refers to the 
flag area in each instruction block and, when decoded data 
exists for an instruction block, only needs to read the 
decoded data and pass it on to the executing unit 3910. When 
this happens, the search unit ,3405 does not need to search 
the search table. In addition to the effects achieved by 
the virtual machine 3800 of the fifth embodiment, the 



120 



present virtual machine 3900 can execute the instruction 
blocks that already include decoded data in a shorter time. 

Note that in the present embodiment, the virtual 
machine code area and real machine area in each instruction 
block were described as having a positional relationship 
whereby corresponding addresses are separated by a 
predetermined offset, although this need not be the case. 
As one example, the limitations of this positional 
relationship can be removed by providing each instruction 
block with an offset address for specifying the first 
address in the decoded instruction sequence storing area. 
When such offset addresses are provided, the flag and offset 
address of the instruction block can be read whenever a 
branch is performed to a new instruction block. In this 
way, addresses that respectively suit the virtual machine 
code area and real machine cpde area can be set in the PC 
3804 in accordance with the current flag- 

Seventh Embodiment 

The following describes the virtual machine 4000 of 
the seventh embodiment of the present invention. This 
virtual machine 4 000 dynamically generates the decoded 
instruction sequences for the virtual machine of the sixth 
embodiment . 

Construction of the virtual Machine 
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Fig. 90 is a block diagram showing the construction 
of the virtual machine 4000 in this seventh embodiment. 
This virtual machine 4000 includes an instruction storing 
unit 3901, a decoding unit 4002, an executing unit 3910., and 
a stack 4420. 

As can be seen by comparing Fig. 90 with Fig. 85, the 
present virtual machine 4000 has almost the same 
construction as the virtual machine 3900 of the sixth 
embodiment* The differences between the two lie in the 
provision of the decoded instruction sequence writing unit 
4008 in the decoding unit 1002 and in the accompanying 
changes to the internal wiring of the decoding unit 4002. 
The following explanation focuses on these differences 
between the present virtual machine 4000 and the virtual 
machine 3900 of the sixth embodiment. 

The decoded instruction sequence writing unit 4008 
operates as follows. When execution control by the present 
virtual machine 4000 has branched to an instruction block 
that does not have a decoded instruction sequence/ the 
decoded instruction sequence writing unit 4008 halts the 
execution of the instruction block and then has the entire 
virtual machine program located in that instruction block 
converted into a decoded instruction sequence by the 
instruction reading unit 3903 /and the search unit 4405. The 
decoded instruction sequence writing unit 4008 then writes 
the decoded instruction sequence into decoded instruction 
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sequence storing area of that instruction block. After 
this, the decoded instruction sequence writing unit 4008 has 
the reading by the instruction reading unit 3903 and 
executing by the executing unit 3910 recommenced for the 
decoded instruction sequence it has written. 

As a result, only decoded data that has been read 
from the instruction storing unit 3901 by the instruction 
reading unit 3903 is passed over to the executing unit 3910 
without amendment. Decoded data that is obtained by the 
search unit 4405 searching the decode table 4406 is not 
directly passed over to thfe- executing unit 3910. This 
differs from the sixth embodiment, and corresponds to the 
decoded data being sent from the search unit 4405 not to the 
executing unit 3910 but to the decoded instruction sequence 
writing unit 4008. 

Operation of the Virtual Machine 

The following describes the operation of the present 
virtual machine 4000. 

Fig. 91 is a flowchart showing the characteristic 
operation of the virtual machine 4 000 when executing a 
branch instruction. This characteristic operation is the 
operation of the decoded instruction sequence writing unit 
4008, the current flag read control unit 3912, and the 
branch destination converting 'unit 3811 • When branching to 
a new instruction block, the updating the value of the PC 
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3804 by the branch destination converting unit 3811 (steps 
9102,9103) and the updating of the content of the current 
flag storing unit 3907 by the current flag read control unit 
3912 use the same procedures as the sixth embodiment shown 
in Fig. 89, The difference between the present embodiment 
and the sixth embodiment lies in the subsequent generation 
and writing in the instruction storing unit 3901 of a 
decoded instruction sequence by the decoded instruction 
sequence writing unit 4008 (steps 9105-9111)* 

In more detail, the decoded instruction sequence 
writing unit 4008 receives* and refers to the flag that has 
been read by the instruction reading unit 3903 to judge 
whether a decoded data sequence has already been stored for 
the present instruction block (step 9105) - 

On finding that a decoded instruction sequence 
exists, the decoded instruction sequence writing unit 4008 
performs no particular processing (step 9112) . When this is 
the case, the decoded instruction sequence in present block 
is read 'out in order and is directly executed by the 
executing unit 3910. 

On the other hand, when no decoded instruction 
sequence exists, the decoded instruction sequence writing 
unit 4008 increments the pointer dPC (steps 9106-9111) while 
having the instruction reading"'unit 3903 successively read 
the virtual machine codes in the present instruction block 
(steps 9108, 9109) and having the search unit 4405 convert 
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the read virtual machine codes into decoded data with the 
required jump addresses. Here, the decoded instruction 
sequence writing unit 4008 writes the resulting decoded data 
into th<=\ decoded instruction sequence storing area of the 

5 present instruction block (step 9110) . 

Once the converting into decoded data and writing has 
been completed for all of the virtual machine code in the 
present block (step 9107), the decoded instruction sequence 
writing unit 4008 writes a flag showing a decoded data 

10 sequence exists into the current flag storing unit 3907 and 

into the flag area of the present instruction block and 
thereby completes its processing (step 9112) . As a result, 
the reading by the instruction reading unit 3903 and the 
executing by the executing unit 3910 can recommence for the 

15 decoded instruction sequence of the instruction block. 

•rig. 92 is a flowchart showing the details of the 
processing in step 9110 of Fig. 91, which is to say, the 
conversion from virtual machine code into decoded data and 
the storage in instruction storing unit 3901. As can be 

20 seen by comparing Fig. 92 with Fig. 7, the present 

processing is composed of the processing of the conventional 
search unit 4405 plus the processing by the decoded 
instruction sequence writing unit 4008. This processing by 
the decoded instruction sequence writing unit 4008 writes 

25 the jump addresses da obtained by searches of the decode 

table 4406 and the operands of virtual machine instructions 
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into the instruction storing unit 3901 as decoded data 
(steps 9209, 9213) . 

Fig. 93 is a flowchart showing the operation of the 
decoding unit 4002 when viewed from the executing unit 3910* 
5 The instruction reading unit 3903 only passes decoded data 

read from a real machine code area of the instruction 
storing unit 3901 to the executing unit 3910, and so from 
its relation with the executing unit 3910 can be said to 
function as a specialized reading unit for decoded data. 

10 As described above, when a branch has been performed 

to an instruction block thfo'fc does not have a decoded 
instruction sequence, the virtual machine 4000 of the 
present embodiment first has the virtual machine code in 
that instruction block converted into decoded data that is 

15 written into the instruction storing unit 3901, with this 

decoded data then being directly executed. As a result, 
when this execution block is next executed, the same decoded 
data can be read and directly executed, so that the time 
taken for decoding, which is to say, the time taken by the 

20 search unit 4405 to search the decode table 4406, can be 

saved. The resulting increase in execution speed is 
especially pronounced when a s^me instruction block is 
repeatedly executed, such as for a loop process. 

25 Eighth ^odiment 

The following describes the virtual machine 4100 of 
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the eighth embodiment. This virtual machine 4100 is similar 
to the virtual machine of the seventh embodiment, but uses 
data compression. 

5 Construction of the Virtual Machine 

Fig. 94 is a block diagram showing the construction 
of the virtual machine 4100 in this seventh embodiment. 
This virtuaJ machine 4100 includes an instruction storing 
unit 4101, a decoding unit 4102, an executing unit 3910, and 

10 a stack 4420. 

As can be seen by Comparing Fig. 94 with Fig. 90, the 
present virtual machine 4100 has almost the same 
construction as the virtual machine 3900 of the sixth 
embodiment. The differences between the two lie in the 

15 code format of the virtual machine program stored in the 

instruction storing unit 4101, in the provision of the 
restoring information storing areas 4157a-4157d in the 
instruction storing unit 4101, and in the addition of the 
virtual machine instruction restoring unit 4103a to the 

20 instruction reading unit 4103 of the decoding unit 4102. 

The following explanation focuses on these differences 
betwee- present virtual machine 4100 and the virtual 

machine 4000 of the seventh embodiment. 

The branch instruction^storing areas 4154a-4154d and 

25 non-branch instruction storing areas 4155a-4155d (hereafter 

collectively called the "compressed virtual machine code 
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areas) of the instruction storing unit 4101 store compressed 
virtual machine instructions in advance- The restoring 
information storing areas 4157a-4157d of the instruction 
storing unit 4101 each store a decompression table for 
decompressing the compressed virtual machine instructions 
that are stored in the corresponding instruction block. 

Fig. 95A shows an example of a decompression table. 
This table includes numerous pairs of a compressed bit 
sequence and the corresponding virtual machine instruction. 

Fig. 95B shows the rules governing codes in the 
decompression table shown §Lri Fig. 95A. In this embodiment, 
single virtual machine instructions including operands are 
compressed into bit sequences according to a bit compression 
method based on Huffman coding. As one example, the bit 
sequence "000" represents the virtual machine instruction 
"Push [0]", while the bit sequence "01010" represents the 
virtual machine instruction "Push 10". 

Figs, 96A-96C show examples of the stored state of a 
virtual machine program that is stored in the instruction 
storing unit 4101. This virtual machine program is 
equivalent to the sample virtual machine program shown in 
Fig. 27. The compressed virtual machine code areas 
4158a-4158c, composed of the non-branch instruction storing 
areas 4l54a-4154c and the branch instruction storing areas 
4155a-4155c, in the instruction block storing areas 
4152a-4152c respectively store bit sequences (hereafter, 
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"compressed bit sequences") that are obtained by compression 
encoding the virtual machine program in the corresponding 
instruction block and linking the results into sequences. 
Each restoring information storing area 4157a~4157c stores a 
5 decompression table for decompressing the bit sequences in 
the corresponding compressed virtual machine code areas 
4158a~4158c. Note that Fig. 96B shows the instruction block 
storing area 4152b that does not have a decoded instruction 
sequence, while Fig. 96C shows the instruction block storing 

10 area 4152c that has a decoded instruction sequence. 

The instruction reading unit 4103 has the same 
functions as the instruction reading unit 3903 of the 
seventh embodiment, which is to say the instruction reading 
unit 410 3 reads compressed bit sequences from the compressed 

15 virtual machine code areas 4158a~4158d in the instruction 

storing -unit 4101 and reads decoded instruction sequences 
from the decoded instruction sequence storing areas 
4156a~4156d, However, the instruction reading unit 4103 is 
also provided with a virtual machine instruction restoring 

20 unit 4103a. 

The virtual machine instruction restoring unit 4103a 
operates as follows. When the "instruction reading unit 4103 
reads one bit at a time in a compressed bit sequence from 
one of the compressed virtual machine code areas 4158a~4158d 
25 in the instruction storing unit 4101, the virtual machine 

instruction restoring unit 4103a refers to a decompression 
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table stored in the corresponding restoring information 
storing area 4157a*-4157d and specifies the virtual machine 
instruction that corresponds to the read compressed bit 
sequence. The virtual machine instruction restoring unit 
5 4103a then passes this virtual machine instruction on to the 

search unit 4405, These processes compose the decompression 
(restoring) processing that is repeated by the virtual 
machine instruction restoring unit 4103a. 

10 Operation of the Virtual Machine 

The following describes the operation of the present 
virtual machine 4100. 

As mentioned above, the present virtual machine 4100 
includes all of the functions of the virtual machine 4000 of 

15 the seventh embodiment, so that the overall processing by 

the virtual machine 4100 is ^the same except for the 
decompression of the compressed bit sequences. Accordingly, 
the processing of the virtual machine 4100 is the same as 
that shown by the flowchart in Fig. 91. 

20 The present virtual machine 4100 operates in Lhe same 

way as the virtual machine 4 000 in the seventh embodiment 
when there is a branch to an instruction block that does not 
have a decoded instruction sequence. The instruction 
reading unit 4103 and search unit 4405 first convert the 

25 virtual machine program in this instruction block into 

decoded data which is written into the instruction storing 
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unit 4101 by the decoded instruction sequence writing unit 

4008. After this, the resulting decoded instruction 

sequence is read by the instruction reading unit 4103 and 

directly executed by the executing unit 3910. 

5 The virtual machine 4100 of the present embodiment 

differs from the virtual machine 4000 in that it reads 

virtual machine instructions that have been compressed. As 

a result, the detailed processing in steps 9109 and 9110 of 

Fig. 91 differs from the processing in the seventh 

hflO embodiment. This is because a suitable read process must be 

perf oriufco. for the compressed bit sequences and a 

jip decompression process must be additionally performed. 

Fig. 97 is a flowchart showing the detailed 

''7" processing of steps 9109 and 9110 in the Fig. 91 for this 

^15 eighth embodiment. This processing is performed by the 

O decoding unit 4102 of the virtual machine 4100, Here, steps 

9602 and 9603-9616 in Fig. 97 respectively correspond to 

^ steps 9109 and 9110 in Fig. 91, 

As can be understood by comparing Fig. 97 with Fig. 

20 92 that shows the operation in the seventh embodiment, the 

differences between the two are as follows. First, instead 

of reading the virtual machine 'code directly, the present 
i 

embodiment reads compressed bit sequences and performs 
decoding (step 9602), Second*" operands (the patterns op[i]) 
are also obtained as necessary during the decoding (step 
9602)/ so that instead of reading the operands from the 
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instruction storing unit 4101, the present embodiment writes 
these operands (the patterns op[i]) into the decoded 
instruction sequence storing areas 4156a-4156d (step 9613) , 
Fig. 98 is a flowchart showing the details of step 9602 in 
5 Fig. 97. The instruction reading unit 4103 first reserves a 

temporary storage area (the variable bits) for the 
compressed bit sequences (step 9702) / and then reads one bit 
of compressed code from one of the compressed virtual 
machine code areas 4158a~4158d in one of the instruction 
J^plO block storing areas 4152a~4152d that does not have a decoded 

^ instruction sequence (step5 9703) . The instruction reading 

■CP unit 4103 links this read bit with the compressed codes (the 

■m variable bits) that it has already read (step 9704) . 

7 The virtual machine instruction restoring unit 4103a 

j p:l5 compares the compressed code (the variable bits) obtained in 

^ff step 9704 in order with each : compressed code sequence 

«£p registered in the decoding table in a restoring information 

storing area 4157a-4157d that starts from an address given 
by adding a predetermined offset to the value of the PC 
20 3804, and so specifies the matching virtual machine 

instruction (step 9705). This reading (step 9703) and 
search (step 9705) are repeated until a matching virtual 
machine instruction is found (step 9706). 

When a matching virtual machine instruction has been 
25 found, the virtual machine instruction restoring unit 4103a 

reads that virtual machine instruction from that restoring 
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information storing area 4157a-4157d (step 9707) and outputs 
the virtual machine instruction to the search unit 4405, 
having separated the virtual machine instruction into an 
opcode and operand (the pattern op[]) when such operand 
5 exists (^teps 9708, 9709) . After this, the search unit 4405 

converts the virtual machine instruction into the 
corresponding decoded data, as shown in steps 9603-9614 in 
Fig. 97, and the decoded instruction sequence writing unit 
4 008 writes this decoded data with the operand pattern op[] 
3° if necessary into the real machine code area of the 
^ corresponding instruction Block. In this way, the virtual 

jjj^ machine 4100 of the present embodiment arranges a compressed 

CP virtual machine program into each instruction block in the 

instruction storing unit 4101, so that when there is a 
j3 5 branch to an instruction block that does not have a decoded 

;jrf instruction sequence, the victual machine 4100 first 

*Dj decompresses the compressed virtual machine program in that 

instruction block; converts it into decoded data, and writes 
the decoded data into the instruction storing unit 4101 so 
20 that the decoded data can then be directly executed. 

As a result, the virtual machine 4100 of the present 
embodiment guarantees that each. compressed bit sequence will 
always be decoded starting from the start of an instruction 
block, which is to say, from the start of a complete 
25 instruction. As a result, the problems caused when the 

execution of a branch instruction leads to decoding being 
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mistakenly performed starting midway through a compressed 
bit sequence can be completely avoided. In this way, the 
present embodiment realizes a virtual machine that can 
correctly execute virtual machine programs that have been 
compressed. 

Note that while the instruction block storing areas 
4152a-4152d in the instruction storing unit 4101 of the 
present embodiment are provided with decoded instruction 
sequence storing areas 4156a-4156d, provided that the 
conventional problem of failing to decode a compressed bit 
sequence from its start car? still be avoided/ these decoded 
instruction sequence storing areas 4156a-4l56d may be 
omitted. 

^his is to say, the virtual machine 4100 of the 
present embodiment was described as corresponding to the 
virtual machine 4000 of the seventh embodiment, which 
includes the decoded instruction sequence storing areas 
4156a~4156d, but having a further function of being- able to 
decode and execute virtual machine programs that have been 
compressed. However, it is also possible to achieve a 
virtual machine that corresponds to the virtual machine 
3800, which does not have decoded instruction sequence 
storing areas 4156a-4156d, but is capable of decoding and 
executing virtual machine programs that have been 
compressed. in either case, the compressed virtual machine 
program is stored in units of instruction blocks based on 
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basic blocks, and the branch destination of every branch 
instruction is guaranteed to be the first instruction in an 
instruction block- This means that compressed bit sequences 
will not be mistakenly decoded starting midway through. 

Note that while the present embodiment uses Huffman 
coding bo compress the virtual machine instruction, it 
should be obvious that LZ methods or other compression 
techniques may be used. 

Ninth Embodiment 

The following explains the JIT compiler that is a 
ninth embodiment of the present invention. This JIT 
compiler can quickly generate real machine code that 
satisfies the boundary restrictions relating to jump 
destinations in the target real machine 

Construction of the Compiler System 

Fig. 99 is a functional block diagram showing the 
entire JTT compiler 4300 of the present embodiment. This 
figure shows not only JIT compiler 4300, but also the 
virtual machine compiler 4320 that generates the information 
that needs to be inputted into the JIT compiler 4300. 

The virtual machine compiler 4320 is equipped with 
language conversion functions that are provided in a 
standard compiler, which means that it receives an input of 
a source program written in a high-level language like n C", 
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generates virtual machine codes for a specified virtual 
machine, and outputs the resulting virtual machine codes to 
circuit Dl. However, the virtual machine compiler 4320 is 
further equipped with a block start information generating 
5 unit 4321a that generates special information (the block 

start information) that is required by the JIT compiler 4300 
and outputs this special information to the circuit D2. 

The block start information generating unit 4321a is 
a function that is additionally provided in an output unit 
4321 of a standard compiler, which is to say, an output unit 
;;f 4321 that sequentially outputs virtual machine codes, which 

ffl are finally obtained after syntactic analysis and conversion 

into intermediate code, to the periphery- This block start 
J*~ information generating unit 4321a judges whether each 

]35 virtual machine code outputted from the output unit 4 321 to 

the circuit Dl should be made the start of a basic block, 
: # and outputs the block start information that shows the 

results of these judgments to the circuit D2. 

The JIT compiler 4300 receives an input of the 
20 virtual machine codes and the block start information 

generated by the virtual machine compiler 4320, and converts 
the virtual machine codes into a real machine instruction 
sequence 4311 for a real machine that has a restriction 
whereby the branch destination's of real machine instructions 
25 are based on the two-word alignment in the address space - 

This JIT compiler 4300 includes a real machine instruction 
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converting unit 4301, a branch position amending unit 4302 , 
and a real machine address storing unit 4303. 

The real machine instruction converting unit 4301 
operates as follows. When a virtual machine code outputted 
from the virtual machine compiler 4320 via the circuit Dl is 
an opcode/ the real machine instruction converting unit 4301 
converts the virtual machine code into the corresponding 
real machine code based on an internal conversion table. On 
the other hand, when a virtual machine code is an operand, 
the real machine instruction converting unit 4301 outputs 
the operand as it is to thfe branch position amending unit 
4302. When doing so, the real machine instruction 
converting unit 4301 reads the real machine address PC 
stored by the real machine address storing unit 4303 and 
outputs it together with the real machine code to the branch 
position amending unit 4302, "before updating the real 
machine address PC. 

The real machine address storing unit 4303 stores a 
relative address PC in the real machine space at which the 
next real machine code to be generated should be placed in 
the real machine instruction converting unit 4301. 

The branch position amending unit 4302 judges whether 
the real machine instruction at the start of a basic block 
is positioned at an odd-numbered address, based on the real 
machine address PC sent from the real machine instruction 
converting unit 4301 and the block start information 
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outputted from the virtual machine compiler 4320 via the 
circuit D2. This is to say, the branch position amending 
unit 43C? judges whether this starting real machine 
instruction violates the restriction concerning the two-word 
alignment in the address space. If the address violates 
this restriction, the branch position amending unit 4 302 
inserts a one-word dummy instruction, which is to say, a no- 
operation instruction "Nop n in front of the instruction, 
before outputting the real machine code send from the real 
machine instruction converting unit 4301 to the periphery as 
part of the real machine instruction sequence 4311. By 
doing so, the branch position amending unit 4302 arranges 
the effective start of the basic block at, an address 
complying with the two-word alignment without affecting the 
processing content of the program. 

Ope rati oh of the Compiler System 

The following is an explanation of a compiler system 
of the above construction, focusing on the differences with 
a standard compiler. 

Fig. 100 is a flowchart showing the operation of the 
block start information generating unit 4321a of the virtual 
machine compiler 4320. This flowchart has fundamentally the 
same flow as the operation of the virtual machine compiler 
of the fifth embodiment that was shown in Fig. 80. 

First, the block start information generating unit 
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4321a judges whether each virtual machine code that the 
output unit 4321 is trying to output should be made the 
start of a basic block (steps 10003, 10004) . The block 
start information generating unit 4321a outputs block start 
fi information "T" on judging that a virtual machine code 

should be made the start of a basic block, (step 10006)/ or 
otherwise outputs the block start information "N" (step 
10005) . The block start information generating unit 4321a 
outputs the block start information "T" or "N" to the 

*pL0 circuit Dl and the virtual machine code VC to circuit D2 

jJn (step 10007) . " " ' 

^ rig. 101 is a flowchart showing the operation of the 

ft real machine instruction converting unit 4301 , the branch 

position amending unit 4302, and the real machine address 

-IKE?" 

,pl5 storing unit 4303. First, the real machine address storing 

$fg unit 4303 initializes the real machine address PC (step 

J 10102). 

The real machine instruction converting unit 4 301 
receives the virtual machine code VC outputted by the block 

20 start information generating unit 4321a (steps 10103,10304), 

converts the virtual machine code VC into a corresponding 
real machine code as necessary, * and transfers this to the 
branch position amending unit 4 302 together with the real 
machine address PC read from the real machine address 

25 storing unit 4303- After this, the real machine instruction 

converting unit 4301 increments the real machine address PC 
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(step 10105) . 

Following this, the branch position amending unit 
4302 receives the block start information "BI" corresponding 
to the virtual machine code VC from the block start 
5 information generating unit 4321a (step 10106) and, when 

outputting the real machine code received from the real 
machine instruction converting unit 4301, judges whether the 
virtual machine code will cause a violation of the boundary 
restrictions described earlier (steps 10107, 10108) . 
r =jji0 Specifically, the branch position amending unit 4302 judges 

\jp whether the block start information BX received from the 

^ block start information generating unit 4321a is "T" showing 

the start of a basic block and the real machine address PC 
received from the real machine instruction converting unit 
X-15 4301 violates the two-word alignment restriction (steps 

ij 10107, 10108). 

On judging that a virtual machine code VC should be 
made the start of a basic block and that the real machine 
address PC violates the two-word alignment restriction, the 
20 branch position amending unit 4302 generates and outputs a 
real machine instruction "Nop", ..before outputting the 
aforementioned real machine instruction as part of the real 
machine instruction sequence 4311 (steps 10109, 10110) . 
Note that whenever the branch position amending unit- 4302 
25 generates "Nop" real machine instructions (step 10110), it 

also updates the real machine address PC in the real machine 
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address storing unit 4303 accordingly* 

The processing in steps 10104-10110 described above 
is repeated while virtual machine codes are transferred from 
the block start information generating unit 4321a (steps 
5 10103, 10111), 

Fig. 102 is a table showing the block start 
information generated by the block start information 
generating unit 4321a, the timing of the generation of "Nop" 
real machine instructions by the branch position amending 
;g) unit 4302, and other related information, for a case when 

the sample virtual machine -instruction sequence shown in 
!jfj Fi g- 27 is inputted into the JIT compiler 4300* As can be 

41 J seen from Fig. 102, the virtual machine instructions at the 

virtual machine addresses 0, 8, 15, and 31 are each set as 
|5 the start of a basic block, so that the block start 

:;f information "T" is generated for these instructions. 

% When processing the virtual machine address 15, the 

branch position amending unit 4302 receives the block start 
information "T n from the block start information generating 
20 unit 4321a and an odd number (35). as the real machine 

address PC from the real machine instruction converting unit 
4301. Before outputting the virtual machine instruction 
corresponding to the virtual machine instruction "Push[l] TT , 
the branch position amending unit 4302 outputs a "Nop" 
25 instruction. As a result, cases where the first instruction 

in a block is located at an odd-numbered address can be 
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avoided. 

With the JIT compiler 4300 of the present embodiment, 
the analysis of the branch destinations of branch 
instructions does not require the complicated procedure that 
was conventionally necessary. As a result, the JIT compiler 
4300 can generate real machine programs that do not violate 
the boundary restrictions for jump destinations. This is 
because the block start information generating unit 4321a in 
the virtual machine compiler 4320 detects the basic blocks 
and informs the JIT compiler 4300 of the block start 
information. - - 

Compared to a conventional JIT compiler 4300, the JIT 
compiler 4300 of the present invention can eradicate the 
problems regarding boundary restrictions by merely adding 
"Nop" virtual machine instructions based on the block start 
information. As a result, the present embodiment realizes a 
JIT compiler that generates suitable real machine code where 
the jump destinations of jump instructions do not violate ' 
the boundary restrictions. 

Note that while the block start information 
generating unit 4321a of the present embodiment is provided 
as an additional feature of the .output unit 4321 of the 
virtual machine compiler 4320, this may be replaced with a 
procedure for dividing into basic blocks that is provided in 
a standard compiler. As part of optimization, a standard 
compiler will divide a program into basic blocks, so that by 
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outputting block start information obtained during this 
block division procedure to the periphery (the JIT compiler 
4300) , a block start information generating unit 4321a can 
be easily realized. 

In this ninth embodiment, n Nop" instructions are used 
as the no-opera Lion instructions, although such no operation 
instructions do not need to be explicit. As one example, 
instructions that add zero to the value of a register may be 
used instead. 

Also in the present embodiment, alignment processing 
is only performed when positioning real machine instructions 
that are jump destinations, although it should be obvious 
that other instructions may also be rearranged In the same 
way when there is a delayed branch or a canceling branch. 
This means that by merely arranging a required number of no- 
operation instructions at the' start of a basic block, it can 
be guaranteed that delayed branches will be properly 
performed- This is because when basic blocks are arranged 
into memory with no intervals between them, the branch 
instruction that is located at the end of each basic block 
will definitely be linked to the^ required number of no- 
operation instructions, so that -erroneous operations due to 
a delayed branch are avoided. 

The virtual machine, virtual machine compiler, and 
JIT compiler of the present invention have been described by 
way of the first-ninth embodiments, although the present 
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invention is not limited to these embodiments* The 
characteristic components of each embodiment may be combined 
or partially integrated into other embodiments , so that a 
variety of variations of the present invention may be 
realized. 

As one example, by combining the first and fifth 
embodiments, the virtual machine program can be divided into 
basic blocks and stored into an instruction storing unit 
together with the corresponding next instruction 
information. This realizes a high-speed virtual machine 
that removes true data dependency and simplifies the address 
processing by a JIT compiler. 

In the same way, combining the second and eighth 
embodiments realizes an interrupt-handling virtual machine 
that only performs sufficient interrupt handling and 
executes compressed bit sequences for which proper decoding 
is guaranteed- 

In the first embodiment, the next instruction 
information and virtual machine instructions have a separate 
structure to the block start information and virtual machine 
instructions in the ninth embodiment. As shown in Fig. 103, 
however, the virtual machine instructions executed by the 
virtual machine of the present invention may be defined as 
extended virtual machine instructions that have such next 
instruction information and block start information 
embedded. In such a case, by routinely branching after a 
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read has been performed from an instruction storing unit in 
units of extended virtual machine instructions, the next 
instruction information, block start information, and opcode 
and operand (s) of the virtual machine can be distinguished 
and separately obtained. 

In the fifth-eighth embodiments, each instruction 
block storing unit was given a unique identifier, although 
should identifiers do not need to be used if each 
instruction block can be separately identified, such as when 
each instruction block is arranged in an instruction block 
storing unit according to certain rules* 

The virtual machine, virtual machine compiler, and 
JIT compiler of the present invention can each be realized 
by a program that is executed by a standard personal 
computer. It should be obvious that such programs may be 
distributed having been recorded onto a storage medium such 
as CD-ROM or by being transmitted via communication lines. 

Although the present invention has been fully 
described by way of examples with reference to accompanying 
drawings, it is to be noted that various changes and 
modifications will be apparent to those skilled in the art. 
Therefore, unless such changes and modifications depart from 
the scope of the present invention, they should be construed 
as being included therein. 
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